<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-9-166</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Fast NJ-like algorithms to deal with incomplete distance matrices</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Criscuolo</snm>
               <fnm>Alexis</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <email>criscuol@lirmm.fr</email>
            </au>
            <au id="A2" ca="yes">
               <snm>Gascuel</snm>
               <fnm>Olivier</fnm>
               <insr iid="I1"/>
               <email>gascuel@lirmm.fr</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Equipe M&#233;thodes et Algorithmes pour la Bioinformatique, LIRMM, CNRS &#8211; Universit&#233; Montpellier 2, 161 rue Ada, 34392 Montpellier Cedex 05, France</p>
            </ins>
            <ins id="I2">
               <p>Groupe Phylog&#233;nie Mol&#233;culaire, ISEM, CNRS &#8211; Universit&#233; Montpellier 2, C.C. 064, 34095 Montpellier Cedex 05, France</p>
            </ins>
            <ins id="I3">
               <p>Equipe Bioinformatique Th&#233;orique, LSIIT, Universit&#233; Louis Pasteur, Strasbourg 1, P&#244;le API, Boulevard S&#233;bastien Brant, BP 10413, 67412 Illkirch Cedex, France</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>1</issue>
         <fpage>166</fpage>
         <url>http://www.biomedcentral.com/1471-2105/9/166</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18366787</pubid>
               <pubid idtype="doi">10.1186/1471-2105-9-166</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>25</day>
               <month>10</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>26</day>
               <month>3</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>26</day>
               <month>3</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Criscuolo and Gascuel; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Distance-based phylogeny inference methods first estimate evolutionary distances between every pair of taxa, then build a tree from the so-obtained distance matrix. These methods are fast and fairly accurate. However, they hardly deal with incomplete distance matrices. Such matrices are frequent with recent multi-gene studies, when two species do not share any gene in analyzed data. The few existing algorithms to infer trees with satisfying accuracy from incomplete distance matrices have time complexity in <it>O</it>(<it>n</it><sup>4</sup>) or more, where <it>n </it>is the number of taxa, which precludes large scale studies. Agglomerative distance algorithms (e.g. NJ <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>) are much faster, with time complexity in <it>O</it>(<it>n</it><sup>3</sup>) which allows huge datasets and heavy bootstrap analyses to be dealt with. These algorithms proceed in three steps: (a) search for the taxon pair to be agglomerated, (b) estimate the lengths of the two so-created branches, (c) reduce the distance matrix and return to (a) until the tree is fully resolved. But available agglomerative algorithms cannot deal with incomplete matrices.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We propose an adaptation to incomplete matrices of three agglomerative algorithms, namely NJ, BIONJ <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> and MVR <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. Our adaptation generalizes to incomplete matrices the taxon pair selection criterion of NJ (also used by BIONJ and MVR), and combines this generalized criterion with that of ADDTREE <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Steps (b) and (c) are also modified, but <it>O</it>(<it>n</it><sup>3</sup>) time complexity is kept. The performance of these new algorithms is studied with large scale simulations, which mimic multi-gene phylogenomic datasets. Our new algorithms &#8211; named NJ*, BIONJ* and MVR* &#8211; infer phylogenetic trees that are as least as accurate as those inferred by other available methods, but with much faster running times. MVR* presents the best overall performance. This algorithm accounts for the variance of the pairwise evolutionary distance estimates, and is well suited for multi-gene studies where some distances are accurately estimated using numerous genes, whereas others are poorly estimated (or not estimated) due to the low number (absence) of sequenced genes being shared by both species.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Our distance-based agglomerative algorithms NJ*, BIONJ* and MVR* are fast and accurate, and should be quite useful for large scale phylogenomic studies. When combined with the SDM method <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> to estimate a distance matrix from multiple genes, they offer a relevant alternative to usual supertree techniques <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Binaries and all simulated data are downloadable from <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Phylogeny inference methods can be classified into two main categories: character-based (e.g. maximum-parsimony or maximum-likelihood) and distance-based approaches. The latter have low running times which are quite useful (mandatory in some cases) to perform large-scale studies and bootstrap analyses. A number of computer simulations <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp> have shown that distance methods are fairly accurate, though not as accurate as likelihood-based methods that are much more time consuming. Using any distance-based method first requires to estimate the pairwise evolutionary distances between every taxon pair. These distances are usually estimated from DNA, RNA or protein sequences, but can also be obtained from DNA-DNA hybridization experiments or, e.g., computed from morphological data (see <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> for a review on distance estimation from various data types).</p>
         <p>In the last few years, phylogenomic studies (i.e. phylogeny reconstruction from large gene collections <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>) have instigated to the development of fast tree-building techniques being able to infer trees from datasets comprising hundreds of genes and taxa. The <it>low-level gene combination </it>involves concatenating the different genes into a unique <it>supermatrix of characters</it>, and then analyzing this matrix with a standard tree building method. This approach was shown to perform poorly when combined with distance methods, due to inaccurate distance estimations from such large heterogeneous character matrix <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Better distance-based trees are obtained by extracting the phylogenetic information from each gene separately, and then combining resulting information sources into a unique <it>distance supermatrix</it>. The <it>Average Consensus Supertree </it>(ACS <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>) and <it>Super Distance Matrix </it>(SDM <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>) techniques input a collection of distance matrices being estimated from each gene separately (the so-called <it>medium-level combination</it>), or being equivalent to the gene trees (the <it>high-level combination</it>). These distance matrices are deformed, without modifying their topological message, and then averaged to obtain the distance supermatrix, which is finally analyzed using a distance-based tree building algorithm.</p>
         <p>Estimating the distance supermatrix is fast. However, missing entries may occur in distance supermatrices depending on the extent of taxon overlap within the source matrices. For example, with the two large data sets of Driskell et al. <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, which were collected from Swiss-Prot and Gen-Bank thanks to a computer program, the ratio of missing distances is ~19% and ~1.2%, respectively. These distances are missing because only a few genes are sequenced within each species, meaning that a number of species pairs do not share any sequenced gene in common and cannot be compared using available data. However, Driskell et al. showed that, despite the sparseness of data and the fact that only a small subset of these data is potentially phylogenetically informative, a topological signal still emerges, which provides useful insights into the tree of life (see <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> and below for details). Analogous findings were reported by a number of authors in various contexts <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>, and tree building from sparse data has become topical, as can be seen from the flourishing literature on supertrees.</p>
         <p>However, tree building from incomplete distance matrices is NP-hard <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, and thus practical algorithms are heuristics. The indirect approach involves first estimating missing distances by applying an ultrametric <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, additive <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, decomposition-based <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>, or quartet-based <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> completion algorithm. The TREX package <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> provides several implementations of such algorithms to be used before tree building using any standard method with the completed matrix. The direct approach involves using a weighted least-squares (WLS) algorithm and associating missing distances with null weight (i.e. infinite variance), which means that missing distances are simply discarded from WLS computations (<abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, pp. 449). The FITCH algorithm <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> from the PHYLIP package <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> and the MWMODIF algorithm <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> from TREX implement this technique. A combination of both direct and indirect methods is provided by MW* <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> (also available in TREX); this algorithm first applies an ultrametric or additive completion algorithm (depending on the density of missing distances) and then infers a tree using MWMODIF, where weights are set to 1.0 for known distances, 0.5 for estimated distances, and 0.0 for missing distances (if any remain). All these (direct or indirect) algorithms have <it>O</it>(<it>n</it><sup>4</sup>) time complexity or more, where <it>n </it>is the number of taxa. This limits their application to medium-sized datasets (say 200 taxa without bootstrapping, see below).</p>
         <p>Agglomerative algorithms are much faster and allow dealing with thousands of taxa, as soon as the distance matrix is complete. The most popular of them is the Neighbor-Joining (NJ) algorithm <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. Starting from a star tree, agglomerative algorithms iteratively perform the three following steps, until the tree is completely resolved:</p>
         <p>(a) select a taxon pair <it>xy </it>that is agglomerated into a new node <it>u</it>;</p>
         <p>(b) estimate the length of the two so-created external branches <it>ux </it>and <it>uy</it>;</p>
         <p>(c) replace <it>x </it>and <it>y </it>by <it>u </it>in the distance matrix, and estimate the new distances between <it>u </it>and the not-yet-agglomerated taxa.</p>
         <p>Step (a) is more time consuming than the two other steps, because of testing all the <it>O</it>(<it>n</it><sup>2</sup>) taxon pairs to select the optimal one. To this purpose, NJ optimizes a numerical criterion that is denoted as <it>Q</it><sub><it>xy</it></sub>. This criterion admits several interpretations related to the Minimum Evolution principle <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B34">34</abbr></abbrgrp>, but also to the acentrality of the considered pair <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>. In this last interpretation (used here), <it>Q</it><sub><it>xy </it></sub>measures how much the path joining <it>x </it>to <it>y </it>is far from the other taxa <it>i </it>&#8800; <it>x</it>, <it>y</it>. The <it>xy </it>pair maximizing <it>Q</it><sub><it>xy </it></sub>corresponds to the two taxa which are most distant from the other ones and is the best candidate for agglomeration. Another criterion, denoted as <it>N</it><sub><it>xy</it></sub>, is used by ADDTREE <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>; this second criterion is based on the four point condition <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp> and counts the number of taxon quartets <it>xyij </it>where <it>x </it>and <it>y </it>are neighbors. When the distance matrix exactly corresponds to a tree (it is then said to be <it>additive</it>), <it>N</it><sub><it>xy </it></sub>indicates all pairs of sibling taxa in the tree, whereas <it>Q</it><sub><it>xy </it></sub>indicates just one such taxon pair. We shall see that this property of <it>N</it><sub><it>xy </it></sub>is a great advantage when dealing with incomplete distance matrices. Indeed, <it>Q</it><sub><it>xy </it></sub>is sometimes unusable whereas <it>N</it><sub><it>xy </it></sub>is still informative.</p>
         <p>Steps (b) and (c) essentially correspond to distance averaging, which requires <it>O</it>(<it>n</it>) run time. These three steps being repeated <it>n </it>- 2 times, agglomerative algorithms require <it>O</it>(<it>n</it><sup>3</sup>) time when using the <it>Q</it><sub><it>xy </it></sub>pair selection criterion, and <it>O</it>(<it>n</it><sup>4</sup>) with <it>N</it><sub><it>xy </it></sub><abbrgrp><abbr bid="B39">39</abbr></abbrgrp>.</p>
         <p>Several refinements of the NJ algorithm have been proposed. BIONJ <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> minimizes the variances associated to the new distances being estimated during each reduction step (c). This way, BIONJ makes use at each iteration of reliable distance estimates to select the new taxon pairs to be agglomerated. To this aim, BIONJ uses a simple Poisson model of the variances and covariances of the distances being contained in the initial distance matrix. BIONJ was generalized into the <it>Minimum Variance Reduction </it>algorithm (MVR <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>), a WLS variant of which can deal with any distance variance model, but which does not account for the distance covariances. It has been shown using computer simulations that this variant (named WLS-MVR in <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> but referred here as MVR for simplicity) has similar accuracy as NJ when applied to distance matrices estimated from one-gene alignments <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. WEIGHBOR <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> further refines BIONJ approach and uses an agglomeration criterion which accounts for the variances of evolutionary distances. All these algorithms require <it>O</it>(<it>n</it><sup>3</sup>) time. Faster, sophisticated distance-based algorithms have been proposed in the last few years <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr></abbrgrp>, some of them being clearly more accurate than NJ and BIONJ (e.g. FASTME <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> and STC <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>, in <it>O</it>(<it>n</it><sup>2 </sup>log(<it>n</it>)) and <it>O</it>(<it>n</it><sup>2</sup>), respectively).</p>
         <p>In this paper, we propose an adaptation of the agglomerative scheme to quickly infer phylogenetic trees from incomplete distance matrices. We show that the <it>Q</it><sub><it>xy </it></sub>criterion may be rewritten to express the mean acentrality of the <it>xy </it>taxon pair. In the same way, the <it>N</it><sub><it>xy </it></sub>criterion may be rewritten to express the mean number of taxon quartets where <it>x </it>and <it>y </it>are neighbors. By estimating these two means using all available (non-missing) distances, we define the two criteria <inline-formula><m:math name="1471-2105-9-166-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>Q</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyuae1aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@3110@</m:annotation></m:semantics></m:math></inline-formula> and <inline-formula><m:math name="1471-2105-9-166-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>N</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOta40aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@310A@</m:annotation></m:semantics></m:math></inline-formula> which allow for the selection of taxon pairs in step (a), even when the distance matrix is incomplete. Using these two new criteria in the agglomerative scheme requires <it>O</it>(<it>n</it><sup>3</sup>) and <it>O</it>(<it>n</it><sup>4</sup>) run time, respectively. A limitation of <inline-formula><m:math name="1471-2105-9-166-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>Q</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyuae1aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@3110@</m:annotation></m:semantics></m:math></inline-formula> and <inline-formula><m:math name="1471-2105-9-166-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>N</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOta40aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@310A@</m:annotation></m:semantics></m:math></inline-formula> is that they cannot be computed when the distance corresponding to the <it>xy </it>pair is missing (see Methods for more). However, this difficulty is inherent to the problem of building trees from incomplete distance matrices and is encountered (in various forms) by all methods to deal with this problem. Moreover, <inline-formula><m:math name="1471-2105-9-166-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>N</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOta40aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@310A@</m:annotation></m:semantics></m:math></inline-formula> partly circumvents this difficulty thanks to its ability to indicate several relevant pairs, rather than a single one with <inline-formula><m:math name="1471-2105-9-166-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>Q</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyuae1aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@3110@</m:annotation></m:semantics></m:math></inline-formula> (see Methods for more). As running <inline-formula><m:math name="1471-2105-9-166-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>N</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOta40aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@310A@</m:annotation></m:semantics></m:math></inline-formula> requires <it>O</it>(<it>n</it><sup>4</sup>) time, we use a filtering technique: at each step (a) we use <inline-formula><m:math name="1471-2105-9-166-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>Q</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyuae1aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@3110@</m:annotation></m:semantics></m:math></inline-formula> to select the <it>s </it>most promising pairs for agglomeration, and then use <inline-formula><m:math name="1471-2105-9-166-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>N</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOta40aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@310A@</m:annotation></m:semantics></m:math></inline-formula> to select the best of these <it>s </it>pairs. This computational trick (and other refinements, see Methods) greatly improves the accuracy compared to using <inline-formula><m:math name="1471-2105-9-166-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>Q</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyuae1aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@3110@</m:annotation></m:semantics></m:math></inline-formula> only, while requiring <it>O</it>(<it>sn</it><sup>3</sup>) time, where <it>s </it>is a small constant (<it>s </it>= 15 in our experiments). Finally, the original NJ, BIONJ and MVR formulae corresponding to steps (b) and (c) essentially are distance averaging and are easily adapted to incomplete matrices. The three new algorithms are named NJ*, BIONJ* and MVR*, respectively.</p>
      </sec>
      <sec>
         <st>
            <p>Results and Discussion</p>
         </st>
         <p>Several computer simulations are presented in this section to assess the performance of NJ*, BIONJ* and MVR*. We first compare the agglomeration criteria <inline-formula><m:math name="1471-2105-9-166-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>Q</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyuae1aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@3110@</m:annotation></m:semantics></m:math></inline-formula>, <inline-formula><m:math name="1471-2105-9-166-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>N</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOta40aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@310A@</m:annotation></m:semantics></m:math></inline-formula> and their combination with distance matrices that are additive, but contain missing entries. Then, using more realistic datasets, we compare NJ*, BIONJ*, MVR* to FITCH <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> and MW* <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, in terms of both topological accuracy and run times.</p>
         <sec>
            <st>
               <p>Comparison of agglomeration criteria</p>
            </st>
            <p>Our approach is similar to Makarenkov and Lapointe's <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. We analyze with various algorithms and criteria a distance matrix with randomly deleted entries. The distance matrix we use is additive, i.e. is obtained from a tree by computing the path length distance between every taxon pair. Let <it>T </it>denote this tree and (<it>T</it><sub><it>ij</it></sub>) be the corresponding distance matrix, where <it>T</it><sub><it>ij </it></sub>is the path-length (or patristic) distance between taxa <it>i </it>and <it>j </it>in <it>T</it>. When no entry is missing, such an additive matrix uniquely defines <it>T</it>, which is recovered by any consistent algorithms (as are all algorithms being tested here). When entries are missing in (<it>T</it><sub><it>ij</it></sub>), recovering <it>T </it>becomes a difficult task (see above), and we measure how well the algorithms perform when given an increasing number of missing distances. Such data thus are not realistic from a biological stand point, as evolutionary distances estimated from sequences are not additive, but this is a simple and standard approach to compare algorithms and agglomeration criteria.</p>
            <p>We use for the correct tree <it>T </it>the phylogeny of 75 placental mammals from <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. The percentage of missing entries is <it>P</it><sub>miss </sub>= 1%, 5%, 10%, 20%, 30%. For each <it>P</it><sub>miss </sub>value, 500 replicates are randomly generated. From each of these 5 &#215; 500 incomplete additive distance matrices, a tree <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-9-166-i3"><m:semantics><m:mover accent="true"><m:mi>T</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafmivaqLbaKaaaaa@2D16@</m:annotation></m:semantics></m:math></inline-formula> is inferred by FITCH, MW* and BIONJ*. Various values of the <it>s </it>parameter are tested for BIONJ*, in order to compare the topological accuracy of <inline-formula><m:math name="1471-2105-9-166-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>Q</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyuae1aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@3110@</m:annotation></m:semantics></m:math></inline-formula>, <inline-formula><m:math name="1471-2105-9-166-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>N</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOta40aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@310A@</m:annotation></m:semantics></m:math></inline-formula>, and of the combination of these two agglomeration criteria. With <it>s </it>= 1, BIONJ* uses <inline-formula><m:math name="1471-2105-9-166-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>Q</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyuae1aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@3110@</m:annotation></m:semantics></m:math></inline-formula> only. With, <it>s </it>> 1, the taxon pairs corresponding to the <it>s </it>highest values of <inline-formula><m:math name="1471-2105-9-166-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>Q</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyuae1aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@3110@</m:annotation></m:semantics></m:math></inline-formula> are reanalyzed with <inline-formula><m:math name="1471-2105-9-166-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>N</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOta40aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@310A@</m:annotation></m:semantics></m:math></inline-formula> (and with other criteria when ties occur; see Methods). When <it>s </it>becomes large (which is denoted as <it>s </it>= max) BIONJ* uses <inline-formula><m:math name="1471-2105-9-166-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>N</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOta40aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@310A@</m:annotation></m:semantics></m:math></inline-formula> only, as all taxon pairs are retained in the first selection step.</p>
            <p>Each inferred tree <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-9-166-i3"><m:semantics><m:mover accent="true"><m:mi>T</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafmivaqLbaKaaaaa@2D16@</m:annotation></m:semantics></m:math></inline-formula> is compared to the correct tree <it>T </it>by using the quartet distance <it>d</it><sub><it>q </it></sub><abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. This topological distance measures the number of resolved 4-taxon subtrees which are induced by one tree but not the other, and thus is more precise than the widely used bipartition distance <abbrgrp><abbr bid="B48">48</abbr></abbrgrp> which counts the number of internal branches present in one tree but not in the other. Moreover, the quartet distance is less affected than the bipartition distance by small topological errors, e.g. wrong position of a single taxon <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. This distance is normalized: <it>d</it><sub><it>q </it></sub>= 0 indicates that <it>T </it>and <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-9-166-i3"><m:semantics><m:mover accent="true"><m:mi>T</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafmivaqLbaKaaaaa@2D16@</m:annotation></m:semantics></m:math></inline-formula> are identical, whereas <it>d</it><sub><it>q </it></sub>= 1 means that both trees do not share any resolved 4-taxon subtrees. Averages of the 500 <it>d</it><sub><it>q </it></sub>measures for each <it>P</it><sub>miss </sub>value are displayed in Figure <figr fid="F1">1</figr>, for FITCH, MW*, and BIONJ* with various <it>s </it>values.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Topological accuracy depending on the rate of missing entries</p>
               </caption>
               <text>
                  <p><b>Topological accuracy depending on the rate of missing entries</b>. Horizontal axis: percentage of missing distances (<it>P</it><sub>miss</sub>. Vertical axis: topological accuracy measured by the mean (over 500 trials) quartet distance (<it>d</it><sub><it>q</it></sub>) between the correct and inferred trees. <it>s</it>: number of taxon pairs that BIONJ* first selects using NJ-like <inline-formula><m:math name="1471-2105-9-166-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>Q</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyuae1aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@3110@</m:annotation></m:semantics></m:math></inline-formula> criterion (6), and then analyzes using score-based <inline-formula><m:math name="1471-2105-9-166-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>N</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOta40aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@310A@</m:annotation></m:semantics></m:math></inline-formula> criterion (9) (and criteria (8), (10), (11) in case of ties). The distance matrix is additive, and thus all methods recover the correct tree when <it>P</it><sub>miss </sub>= 0.</p>
               </text>
               <graphic file="1471-2105-9-166-1"/>
            </fig>
            <p>All curves in Figure <figr fid="F1">1</figr> are decreasing; as expected, the correct tree <it>T </it>is better recovered (i.e. the mean <it>d</it><sub><it>q </it></sub>value between <inline-formula><m:math xmlns:m="http://www.w3.org/1998/Math/MathML" name="1471-2105-9-166-i3"><m:semantics><m:mover accent="true"><m:mi>T</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGafmivaqLbaKaaaaa@2D16@</m:annotation></m:semantics></m:math></inline-formula> and <it>T </it>decreases) as the proportion of missing distance <it>P</it><sub><it>miss </it></sub>becomes closer to 0. Using <inline-formula><m:math name="1471-2105-9-166-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>N</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOta40aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@310A@</m:annotation></m:semantics></m:math></inline-formula> in BIONJ* greatly improves the agglomeration step; e.g. with <it>P</it><sub>miss </sub>= 10%, mean <it>d</it><sub><it>q </it></sub>values of BIONJ* are ~0.0015 and ~0.0008, with <it>s </it>= 1 and <it>s </it>= 15, respectively. However, there is no significant difference between <it>s </it>= 15 and <it>s </it>= max (as assessed by a sign-test <abbrgrp><abbr bid="B50">50</abbr></abbrgrp> based on the 500 replicates, all <it>p</it>-values are much larger than 0.05), meaning that a small value of <it>s </it>(e.g. <it>s </it>= 15) seems to be enough to focus on the most relevant pairs, while avoiding the computational burden of using <inline-formula><m:math name="1471-2105-9-166-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>N</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOta40aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@310A@</m:annotation></m:semantics></m:math></inline-formula> only. Further experiments (see below) confirm this finding. FITCH and BIONJ* (with <it>s </it>= 15 and <it>s </it>= max) have similar accuracy, while MW* tends to perform better than the other algorithms with these data. However, we shall see that algorithm ordering is different with more realistic simulations. These experiments thus confirm the advantage of combining <inline-formula><m:math name="1471-2105-9-166-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>Q</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemyuae1aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@3110@</m:annotation></m:semantics></m:math></inline-formula> and <inline-formula><m:math name="1471-2105-9-166-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>N</m:mi><m:mrow><m:mi>x</m:mi><m:mi>y</m:mi></m:mrow><m:mo>&#8727;</m:mo></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOta40aa0baaSqaaiabdIha4jabdMha5bqaaiabgEHiQaaaaaa@310A@</m:annotation></m:semantics></m:math></inline-formula> within BIONJ*, and similar results (not shown) are obtained with NJ* and MVR*.</p>
         </sec>
         <sec>
            <st>
               <p>Comparison of reconstruction algorithms with distance supermatrices</p>
            </st>
            <p>We re-use a simulation protocol that we have used previously to compare a number of tree-reconstruction methods in a phylogenomic context <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. This protocol involves generating sequences and evolving them along trees, and is more realistic than the comparison described above. We first summarize this protocol, and then report the results that are obtained with the simulated datasets by FITCH, MW*, NJ*, BIONJ* and MVR*. To estimate the distance supermatrix that is the input of these algorithms, we use the SDM method (<abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, see also Methods) which computes a supermatrix that summarizes the topological signal being contained in a collection <inline-formula><m:math name="1471-2105-9-166-i4" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mrow><m:mo>{</m:mo><m:mrow><m:mrow><m:mo>(</m:mo><m:mrow><m:msubsup><m:mi>&#916;</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mn>1</m:mn></m:msubsup></m:mrow><m:mo>)</m:mo></m:mrow><m:mo>,</m:mo><m:mrow><m:mo>(</m:mo><m:mrow><m:msubsup><m:mi>&#916;</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mn>2</m:mn></m:msubsup></m:mrow><m:mo>)</m:mo></m:mrow><m:mo>,</m:mo><m:mn>...</m:mn><m:mo>,</m:mo><m:mrow><m:mo>(</m:mo><m:mrow><m:msubsup><m:mi>&#916;</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mi>k</m:mi></m:msubsup></m:mrow><m:mo>)</m:mo></m:mrow></m:mrow><m:mo>}</m:mo></m:mrow></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaWaaiWaaeaadaqadaqaaiabfs5aenaaDaaaleaacqWGPbqAcqWGQbGAaeaacqaIXaqmaaaakiaawIcacaGLPaaacqGGSaaldaqadaqaaiabfs5aenaaDaaaleaacqWGPbqAcqWGQbGAaeaacqaIYaGmaaaakiaawIcacaGLPaaacqGGSaalcqGGUaGlcqGGUaGlcqGGUaGlcqGGSaaldaqadaqaaiabfs5aenaaDaaaleaacqWGPbqAcqWGQbGAaeaacqWGRbWAaaaakiaawIcacaGLPaaaaiaawUhacaGL9baaaaa@482D@</m:annotation></m:semantics></m:math></inline-formula> of <it>k </it>distance matrices. Simulations <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> have shown the high-quality of this distance supermatrix in both medium- and high-level gene combinations.</p>
            <p>Simulations are as follows (see <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> for more details). Starting from a randomly generated tree <it>T </it>with <it>n </it>= 48 taxa, evolution of <it>k </it>genes is simulated, with <it>k </it>= 2, 4, ..., 20. For each of the <it>k </it>genes, some taxa are randomly deleted. Two deletion probabilities are used: 25% to preserve high overlap between the different taxon sets, and 75% to induce low overlap. From these <it>k </it>partially deleted gene alignments, <it>k </it>distance matrices are estimated to compose the collection <it>C</it><sub>&#916; </sub>of source matrices. The SDM method is then run with <it>C</it><sub>&#916; </sub>to obtain a distance supermatrix corresponding to a medium-level combination of the <it>k </it>partially deleted genes. To study the high-level combination, a phylogenetic tree is inferred by PhyML <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> from each of the <it>k </it>partially deleted genes; then, the path length distance between each taxon pair for each of the <it>k </it>phylogenies is computed, to form the collection <it>C</it><sub><it>T </it></sub>of <it>k </it>additive distance matrices that are equivalent to the <it>k </it>PhyML trees. Finally, SDM is applied to <it>C</it><sub><it>T </it></sub>to obtain a distance supermatrix corresponding to a high-level gene combination.</p>
            <p>This simulation protocol is repeated 500 times for each value of <it>k </it>and each deletion proportion. We obtain this way (10 gene collection sizes &#215; 500 collections &#215; 2 overlap conditions &#215; 2 gene combination levels) = 20,000 distance supermatrices, which are denoted as (<inline-formula><m:math name="1471-2105-9-166-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#916;</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mrow><m:mtext>SDM</m:mtext></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeuiLdq0aa0baaSqaaiabdMgaPjabdQgaQbqaaiabbofatjabbseaejabb2eanbaaaaa@337D@</m:annotation></m:semantics></m:math></inline-formula>) and are frequently incomplete. Indeed, if taxon <it>i </it>is missing for gene <it>p</it>, then <inline-formula><m:math name="1471-2105-9-166-i6" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#916;</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mi>p</m:mi></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeuiLdq0aa0baaSqaaiabdMgaPjabdQgaQbqaaiabdchaWbaaaaa@3189@</m:annotation></m:semantics></m:math></inline-formula> is missing &#8211; which is denoted as <inline-formula><m:math name="1471-2105-9-166-i6" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#916;</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mi>p</m:mi></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeuiLdq0aa0baaSqaaiabdMgaPjabdQgaQbqaaiabdchaWbaaaaa@3189@</m:annotation></m:semantics></m:math></inline-formula> = &#8709;&#8212;, and if <inline-formula><m:math name="1471-2105-9-166-i6" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#916;</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mi>p</m:mi></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeuiLdq0aa0baaSqaaiabdMgaPjabdQgaQbqaaiabdchaWbaaaaa@3189@</m:annotation></m:semantics></m:math></inline-formula> = &#8709; for all <it>p </it>= 1,2, ..., <it>k</it>, then <inline-formula><m:math name="1471-2105-9-166-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#916;</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mrow><m:mtext>SDM</m:mtext></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeuiLdq0aa0baaSqaaiabdMgaPjabdQgaQbqaaiabbofatjabbseaejabb2eanbaaaaa@337D@</m:annotation></m:semantics></m:math></inline-formula> = &#8709;. With 25% deletion rate, almost all distance supermatrices are complete when <it>k </it>&#8805; 14. With 75% deletion rate, all distance supermatrices are incomplete, but the number of missing distances decreases as <it>k </it>increases (missing distance proportions range from 42% to 11%).</p>
            <p>FITCH and MW* are run with default options. In accordance with Figure <figr fid="F1">1</figr>, <it>s </it>is set to 15 for NJ*, BIONJ* and MVR*. With BIONJ*, <it>V</it><sub><it>ij </it></sub>variances (associated with <inline-formula><m:math name="1471-2105-9-166-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#916;</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mrow><m:mtext>SDM</m:mtext></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeuiLdq0aa0baaSqaaiabdMgaPjabdQgaQbqaaiabbofatjabbseaejabb2eanbaaaaa@337D@</m:annotation></m:semantics></m:math></inline-formula> distance estimates) are naturally defined by <it>V</it><sub><it>ij </it></sub>&#8733; <inline-formula><m:math name="1471-2105-9-166-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#916;</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mrow><m:mtext>SDM</m:mtext></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeuiLdq0aa0baaSqaaiabdMgaPjabdQgaQbqaaiabbofatjabbseaejabb2eanbaaaaa@337D@</m:annotation></m:semantics></m:math></inline-formula> if <inline-formula><m:math name="1471-2105-9-166-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#916;</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mrow><m:mtext>SDM</m:mtext></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeuiLdq0aa0baaSqaaiabdMgaPjabdQgaQbqaaiabbofatjabbseaejabb2eanbaaaaa@337D@</m:annotation></m:semantics></m:math></inline-formula> &#8800; &#8709;, else <it>V</it><sub><it>ij </it></sub>= &#8709;. Variances used by MVR* comply with the same rule, but account for other parameters such as the length and the number of sequences being used to estimate each <inline-formula><m:math name="1471-2105-9-166-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#916;</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mrow><m:mtext>SDM</m:mtext></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeuiLdq0aa0baaSqaaiabdMgaPjabdQgaQbqaaiabbofatjabbseaejabb2eanbaaaaa@337D@</m:annotation></m:semantics></m:math></inline-formula> distance (see Methods). Accuracy of the five algorithms is measured with the topological distance <it>d</it><sub><it>q</it></sub>, as above, and averaged for the 500 replicates corresponding to each of the conditions. Results are reported in Table <tblr tid="T1">1</tblr> for the medium-level gene combination, and in Table <tblr tid="T2">2</tblr> for the high-level gene combination. For each value of <it>k</it>, the first- and second-best mean <it>d</it><sub><it>q </it></sub>values are indicated in bold&amp;underlined and bold, respectively, and a sign-test <abbrgrp><abbr bid="B50">50</abbr></abbrgrp> based on the 500 replicates is used to assess the significance of the difference between these two best values.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Topological accuracy with medium-level distance supermatrices</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c cspan="7" ca="center">
                        <p>
                           <b>(a): 25% taxon deletion rate</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p><b><it>k </it></b>=</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>FITCH</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MW*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>NJ*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>BIONJ*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MVR*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b><it>p</it>-value</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0841</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0906</p>
                     </c>
                     <c ca="center">
                        <p>0.0926</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0841</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0857</p>
                     </c>
                     <c ca="center">
                        <p>0.286</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0504</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0546</p>
                     </c>
                     <c ca="center">
                        <p>0.0595</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0494</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0524</p>
                     </c>
                     <c ca="center">
                        <p>0.466</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0400</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0445</p>
                     </c>
                     <c ca="center">
                        <p>0.0454</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0370</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0410</p>
                     </c>
                     <c ca="center">
                        <p>0.015</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>0.0330</p>
                     </c>
                     <c ca="center">
                        <p>0.0356</p>
                     </c>
                     <c ca="center">
                        <p>0.0386</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0318</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0320</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.958</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0271</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0300</p>
                     </c>
                     <c ca="center">
                        <p>0.0317</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0265</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0286</p>
                     </c>
                     <c ca="center">
                        <p>0.364</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0294</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0317</p>
                     </c>
                     <c ca="center">
                        <p>0.0354</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0284</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0314</p>
                     </c>
                     <c ca="center">
                        <p>0.030</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0245</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0266</p>
                     </c>
                     <c ca="center">
                        <p>0.0286</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0235</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0251</p>
                     </c>
                     <c ca="center">
                        <p>0.816</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>16</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0290</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0318</p>
                     </c>
                     <c ca="center">
                        <p>0.0327</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0282</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0303</p>
                     </c>
                     <c ca="center">
                        <p>0.028</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>18</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0252</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0278</p>
                     </c>
                     <c ca="center">
                        <p>0.0280</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0234</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0265</p>
                     </c>
                     <c ca="center">
                        <p>0.020</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0242</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0259</p>
                     </c>
                     <c ca="center">
                        <p>0.0281</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0230</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0247</p>
                     </c>
                     <c ca="center">
                        <p>0.955</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="7" ca="center">
                        <p>
                           <b>(b): 75% taxon deletion rate</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p><b><it>k </it></b>=</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>FITCH</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MW*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>NJ*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>BIONJ*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MVR*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b><it>p</it>-value</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.2154</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.2174</p>
                     </c>
                     <c ca="center">
                        <p>0.2187</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.2131</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.2163</p>
                     </c>
                     <c ca="center">
                        <p>0.920</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.1683</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.1778</p>
                     </c>
                     <c ca="center">
                        <p>0.1818</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.1713</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.1713</p>
                     </c>
                     <c ca="center">
                        <p>0.060</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.1347</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.1443</p>
                     </c>
                     <c ca="center">
                        <p>0.1534</p>
                     </c>
                     <c ca="center">
                        <p>0.1418</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.1400</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8776; 0.0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.1089</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.1253</p>
                     </c>
                     <c ca="center">
                        <p>0.1302</p>
                     </c>
                     <c ca="center">
                        <p>0.1137</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.1114</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.176</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0878</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.1039</p>
                     </c>
                     <c ca="center">
                        <p>0.1117</p>
                     </c>
                     <c ca="center">
                        <p>0.0959</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0901</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.033</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0825</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0968</p>
                     </c>
                     <c ca="center">
                        <p>0.1021</p>
                     </c>
                     <c ca="center">
                        <p>0.0875</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0842</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.470</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0652</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0749</p>
                     </c>
                     <c ca="center">
                        <p>0.0850</p>
                     </c>
                     <c ca="center">
                        <p>0.0710</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0676</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.464</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>16</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0583</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0731</p>
                     </c>
                     <c ca="center">
                        <p>0.0802</p>
                     </c>
                     <c ca="center">
                        <p>0.0658</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0625</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.335</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>18</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0516</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0617</p>
                     </c>
                     <c ca="center">
                        <p>0.0687</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0552</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0555</p>
                     </c>
                     <c ca="center">
                        <p>0.074</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0503</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0600</p>
                     </c>
                     <c ca="center">
                        <p>0.0682</p>
                     </c>
                     <c ca="center">
                        <p>0.0560</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0509</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.189</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>In the medium-level combination, distance matrices are directly estimated from each of the genes and then combined (using SDM) into the distance supermatrix. Topolological accuracy is mesured by the mean (over 500 trials) quartet distance (<it>d</it><sub><it>q</it></sub>) between the correct and inferred trees. <it>k</it>: number of genes. <it>p</it>-value: sign-test significance when comparing the 500 <it>d</it><sub><it>q </it></sub>values of the two best methods that are indicated in bold and underlined (1<sup>st </sup>method) and bold (2<sup>nd </sup>one)</p>
               </tblfn>
            </tbl>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Topological accuracy with high-level distance supermatrices</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c cspan="7" ca="center">
                        <p>
                           <b>(a): 25% taxon deletion rate</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p><b><it>k </it></b>=</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>FITCH</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MW*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>NJ*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>BIONJ*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MVR*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b><it>p</it>-value</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0558</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0561</p>
                     </c>
                     <c ca="center">
                        <p>0.0586</p>
                     </c>
                     <c ca="center">
                        <p>0.0566</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0522</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8776; 0.0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0337</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0345</p>
                     </c>
                     <c ca="center">
                        <p>0.0361</p>
                     </c>
                     <c ca="center">
                        <p>0.0351</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0319</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8776; 0.0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0253</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0265</p>
                     </c>
                     <c ca="center">
                        <p>0.0272</p>
                     </c>
                     <c ca="center">
                        <p>0.0261</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0235</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8776; 0.0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>0.0227</p>
                     </c>
                     <c ca="center">
                        <p>0.0228</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0213</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0217</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0212</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.094</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0187</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0188</p>
                     </c>
                     <c ca="center">
                        <p>0.0194</p>
                     </c>
                     <c ca="center">
                        <p>0.0192</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0171</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.047</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0197</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0207</p>
                     </c>
                     <c ca="center">
                        <p>0.0215</p>
                     </c>
                     <c ca="center">
                        <p>0.0199</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0191</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.949</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0160</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0164</p>
                     </c>
                     <c ca="center">
                        <p>0.0164</p>
                     </c>
                     <c ca="center">
                        <p>0.0165</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0162</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.882</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>16</p>
                     </c>
                     <c ca="center">
                        <p>0.0208</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0204</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0210</p>
                     </c>
                     <c ca="center">
                        <p>0.0213</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0206</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8776; 0.0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>18</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0170</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0177</p>
                     </c>
                     <c ca="center">
                        <p>0.0177</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0173</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0174</p>
                     </c>
                     <c ca="center">
                        <p>0.271</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="center">
                        <p>0.0162</p>
                     </c>
                     <c ca="center">
                        <p>0.0168</p>
                     </c>
                     <c ca="center">
                        <p>0.0171</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0160</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0158</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.648</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="7" ca="center">
                        <p>
                           <b>(b): 75% taxon deletion rate</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p><b><it>k </it></b>=</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>FITCH</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MW*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>NJ*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>BIONJ*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MVR*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b><it>p</it>-value</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0.1876</p>
                     </c>
                     <c ca="center">
                        <p>0.1877</p>
                     </c>
                     <c ca="center">
                        <p>0.1824</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.1822</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.1817</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.282</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>0.1396</p>
                     </c>
                     <c ca="center">
                        <p>0.1397</p>
                     </c>
                     <c ca="center">
                        <p>0.1390</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.1381</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.1345</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.018</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.1095</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.1125</p>
                     </c>
                     <c ca="center">
                        <p>0.1134</p>
                     </c>
                     <c ca="center">
                        <p>0.1119</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.1065</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.166</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0865</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0892</p>
                     </c>
                     <c ca="center">
                        <p>0.0926</p>
                     </c>
                     <c ca="center">
                        <p>0.0870</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0823</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.005</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0690</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0739</p>
                     </c>
                     <c ca="center">
                        <p>0.0766</p>
                     </c>
                     <c ca="center">
                        <p>0.0723</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0671</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.023</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0641</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0670</p>
                     </c>
                     <c ca="center">
                        <p>0.0705</p>
                     </c>
                     <c ca="center">
                        <p>0.0677</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0616</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.015</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0508</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0538</p>
                     </c>
                     <c ca="center">
                        <p>0.0567</p>
                     </c>
                     <c ca="center">
                        <p>0.0534</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0493</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8776; 0.0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>16</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0504</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0518</p>
                     </c>
                     <c ca="center">
                        <p>0.0554</p>
                     </c>
                     <c ca="center">
                        <p>0.0512</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0457</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8776; 0.0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>18</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0409</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0416</p>
                     </c>
                     <c ca="center">
                        <p>0.0485</p>
                     </c>
                     <c ca="center">
                        <p>0.0424</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0402</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.922</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0403</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.0435</p>
                     </c>
                     <c ca="center">
                        <p>0.0453</p>
                     </c>
                     <c ca="center">
                        <p>0.0431</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0371</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8776; 0.0</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>In the high-level combination, ML trees are first inferred separately for every genes, and then these trees are turned into path-length distance matrices which are combined (using SDM) into the distance supermatrix. For symbols and notation, see note to Table 1.</p>
               </tblfn>
            </tbl>
            <p>In the medium-level gene combination, NJ* and MW* are outperformed by other algorithms. With a 25% deletion rate, BIONJ* has best topological accuracy, followed by FITCH. However, the sign-test indicates that the difference between these two algorithms is moderately significant as the <it>p</it>-value is lower than 0.05 for only five <it>k </it>values (= 6, 8, 12, 16, and 18). With a 75% deletion rate, FITCH is best, but again the sign-test shows that FITCH, BIONJ* and MVR* are broadly equivalent.</p>
            <p>With high-level combination distance supermatrices, NJ* and MW* still tend to be outperformed by other algorithms. BIONJ* is in between, and the best mean <it>d</it><sub><it>q</it></sub> values are observed with MVR* which is followed by FITCH. The sign-test broadly confirms the significance of this observation, though the accuracy difference between MVR* and FITCH is relatively low.</p>
            <p>Altogether, these experiments show that MVR* is at least as accurate as FITCH, that BIONJ* has similar performance, while NJ* and MW* are behind these three algorithms. Comparing these findings with the results from (see Figure 2 in <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>), we see that (in the high-level framework, Table <tblr tid="T2">2</tblr>) MVR* is more accurate than the standard Matrix Representation with Parsimony method (MRP, <abbrgrp><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr></abbrgrp>), in most cases; e.g. with <it>k </it>= 10, MVR* has mean <it>d</it><sub><it>q </it></sub>values of 0.0171 and 0.0663, for 25% and 75% deletion rate, respectively, while mean <it>d</it><sub><it>q </it></sub>values of MRP equal 0.0175 and 0.1152. MVR* (combined with SDM) outperforms MRP with sparse information (75% deletion rate and/or low number of genes), while both approaches are nearly equivalent when the information is abundant (25% deletion rate). An explanation <abbrgrp><abbr bid="B53">53</abbr></abbrgrp> of this finding could be that the distance approach not only uses the topology of the source trees (as MRP) but also their branch lengths. Distance-based supertrees thus contain more information than MRP supertrees, which makes a noticeable difference when the information is sparse, but does not impact much the results with abundant information (see also following simulation results).</p>
         </sec>
         <sec>
            <st>
               <p>Results with simulations based on Driskell et al. <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> dataset</p>
            </st>
            <p>This section aims to measure the accuracy of the different tree building algorithms when applied to simulated datasets being more realistic than those commonly used in a phylogenomic perspective. Most notably, uniformly random gene deletion (used in previous section, following <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>) is not fully realistic because some genes (e.g. cytochrome b) are sequenced for most species, while some other genes are rarely sequenced (or rare among living species). It follows that the gene presence/absence pattern is different with real datasets to this being induced by uniformly random gene deletion (see <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr></abbrgrp> for illustrative examples). To this purpose, we use the character supermatrix from Driskell et al. <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, which comprises 69 green plant species and 254 genes, and was built via an automated exploration process of GenBank. This matrix contains a total number of 2777 sequences and has 87% missing characters, which are unequally distributed among taxa. Only 3 taxa have more than 50% genes, whereas 42 have 10% genes or less. In the same way, a few genes are present in most taxa (e.g., the 2 most sequenced genes belong to 59 taxa), whereas other genes are rare (e.g. 121 genes are present in at most 5 taxa). However, these <it>k </it>= 254 genes are complementary and the SDM distance supermatrix only contains ~1.2% missing entries. This low proportion of missing entries is favorable to tree reconstruction, but still requires an algorithm able to deal with incomplete matrices.</p>
            <p>We use a simulation protocol analogous to that described above <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. The only difference is the deletion procedure, with random deletion replaced by the gene presence/absence pattern of (see Figure 2B in <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>). We generate 100 datasets this way with <it>n </it>= 69 taxa and <it>k </it>= 254 genes. From these 100 datasets, we infer 100 distance matrix collections <it>C</it><sub>&#916; </sub>and 100 tree collections <it>C</it><sub><it>T</it></sub>. Each of these 2 &#215; 100 collections is dealt with by SDM, to obtain a distance supermatrix (<inline-formula><m:math name="1471-2105-9-166-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#916;</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mrow><m:mtext>SDM</m:mtext></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeuiLdq0aa0baaSqaaiabdMgaPjabdQgaQbqaaiabbofatjabbseaejabb2eanbaaaaa@337D@</m:annotation></m:semantics></m:math></inline-formula>) that contains the same missing entries as those induced by the original dataset <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. We use these matrices to compare FITCH, MW*, NJ*, BIONJ* and MVR*, based on <it>d</it><sub><it>q </it></sub>quartet distance between the correct and inferred trees (see above). Our three algorithms are run with both <it>s </it>= 15 and <it>s </it>= max. Results of MRP are also computed, using TNT <abbrgrp><abbr bid="B58">58</abbr></abbrgrp> to infer the most parsimonious trees. TNT is run with 25 random addition sequences, TBR branch swapping and ratchet. The MRP supertree is defined in the standard way <abbrgrp><abbr bid="B59">59</abbr></abbrgrp> as the strict consensus of the most parsimonious trees. Results are displayed in Table <tblr tid="T3">3</tblr>, which is similar to Tables <tblr tid="T1">1</tblr> and <tblr tid="T2">2</tblr>; the first- and second-best mean <it>d<sub>q</sub></it> values are indicated in bold&amp;underlined and bold, respectively, and sign-tests are used to assess the significance of the differences between MVR* (our best algorithm), FITCH and MRP.</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Topological accuracy with datasets generated from Driskell et al. [20]</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c cspan="8" ca="center">
                        <p>
                           <b>(a): medium level</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>FITCH</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MW*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>NJ*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>BIONJ*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MVR*</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="center">
                        <p>
                           <b><it>p</it>-value</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MVR* &#8211; FITCH</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MVR* &#8211; BIONJ*</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0.0234</p>
                     </c>
                     <c ca="center">
                        <p>0.0268</p>
                     </c>
                     <c ca="center">
                        <p>0.0289</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0227</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0171</ul>
                           </b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>&#8776; 0.0</p>
                     </c>
                     <c ca="center">
                        <p>&#8776; 0.0</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="8" ca="center">
                        <p>
                           <b>(b): high level</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>FITCH</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MW*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>NJ*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>BIONJ*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MVR*</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MRP</b>
                        </p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>
                           <b><it>p</it>-value</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MVR* &#8211; FITCH</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MVR* &#8211; MRP</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>0.0161</p>
                     </c>
                     <c ca="center">
                        <p>0.0165</p>
                     </c>
                     <c ca="center">
                        <p>0.0182</p>
                     </c>
                     <c ca="center">
                        <p>0.0172</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <ul>0.0101</ul>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>0.0119</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.001</p>
                     </c>
                     <c ca="center">
                        <p>0.193</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>(a): Medium-level combination of the distance matrices being directly estimated from the gene sequences. (b): High-level combination; ML trees are first inferred separately for every genes; MRP turns the gene trees into a matrix of partial binary characters, which is analyzed with parsimony; with the other (distance) methods, the gene trees are turned into path-length distance matrices which are combined into the distance supermatrix. All combinations of source distance matrices are achieved using SDM. <it>p</it>-value: sign-test significance when comparing the 100 <it>d</it><sub><it>q </it></sub>values of MVR* (our best algorithm) to those of FITCH and MRP. For other symbols and notation, see note to Table 1.</p>
               </tblfn>
            </tbl>
            <p>NJ*, BIONJ* and MVR* do not show any significant difference when used with <it>s </it>= 15 and <it>s </it>= max (as assessed by the sign-test, all <it>p</it>-values are much larger than 0.05, results not shown). This confirms the results of the previous experiments to compare our various agglomeration criteria. NJ* has the worst accuracy, especially in the high-level combination framework. MW*, FITCH and BIONJ* show similar performance, while MVR* is best among distance approaches in the two gene combination levels. Moreover, the difference between MVR* and FITCH is highly significant (sign-test <it>p</it>-value &#8776; 0.0). In the high-level framework, MVR* tends to be better than MRP, although the information is quite abundant (254 genes, ~1.2% of missing distances); however, the difference is not significant with 100 replicates (sign-test <it>p</it>-value &#8776; 0.2). The results among distance methods are explained by the fact that MVR* uses fairly accurate estimates (<inline-formula><m:math name="1471-2105-9-166-i7" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>V</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mrow><m:mtext>SDM</m:mtext></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOvay1aa0baaSqaaiabdMgaPjabdQgaQbqaaiabbofatjabbseaejabb2eanbaaaaa@334C@</m:annotation></m:semantics></m:math></inline-formula>) of the variances of the distances in (<inline-formula><m:math name="1471-2105-9-166-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#916;</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mrow><m:mtext>SDM</m:mtext></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeuiLdq0aa0baaSqaaiabdMgaPjabdQgaQbqaaiabbofatjabbseaejabb2eanbaaaaa@337D@</m:annotation></m:semantics></m:math></inline-formula>). Indeed, dataset <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> induces a highly heterogeneous distribution of missing sequences, meaning that some distances are well estimated thanks to a large number of sequences, while some others are poorly estimated via a few sequences. This is accounted for by MVR* in (<inline-formula><m:math name="1471-2105-9-166-i7" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>V</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mrow><m:mtext>SDM</m:mtext></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOvay1aa0baaSqaaiabdMgaPjabdQgaQbqaaiabbofatjabbseaejabb2eanbaaaaa@334C@</m:annotation></m:semantics></m:math></inline-formula>) calculations (see Methods), while MW*, FITCH and BIONJ* lack this information and use inaccurate estimations of (<inline-formula><m:math name="1471-2105-9-166-i7" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>V</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mrow><m:mtext>SDM</m:mtext></m:mrow></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOvay1aa0baaSqaaiabdMgaPjabdQgaQbqaaiabbofatjabbseaejabb2eanbaaaaa@334C@</m:annotation></m:semantics></m:math></inline-formula>). The difference between these two approaches (i.e. MVR* on the one hand, and MW*, FITCH and BIONJ* on the other hand) is somewhat hidden when using uniformly random sequence deletion, because with the latter all distances are broadly estimated with the same number of genes. With biologically realistic pattern of gene presence/absence, the difference becomes important, especially for the high-level combination. Thus, this last set of simulations confirms the findings of the previous ones and supports the capacity of MVR* for dealing with phylogenomic data.</p>
         </sec>
         <sec>
            <st>
               <p>Run time comparison</p>
            </st>
            <p>Run times with various dataset sizes have been measured on a PC Pentium IV 1.8 GHz (1 Gb RAM) and are displayed in Table <tblr tid="T4">4</tblr>. We do not report the run times of NJ* and BIONJ*, as they are nearly the same as those of MVR*. In fact, NJ* and BIONJ* are ~2% faster than MVR*, because they are simpler, but these simplifications does not concern the heavy <it>O</it>(<it>n</it><sup>3</sup>) parts of the algorithms (see Methods). We also report the run times of SDM <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, which are in the same range as the fastest tree building algorithms, except with Driskell et al. <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>-like datasets, where SDM has to summarize a large number (254) of source matrices, but where the number of taxa (69) is relatively low. In this case, the run time of SDM is analogous to that of FITCH and MW* and remains quite handy (~5 minutes per dataset).</p>
            <tbl id="T4">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Run times</p>
               </caption>
               <tblbdy cols="11">
                  <r>
                     <c cspan="11" ca="center">
                        <p>
                           <b>(a): 25% taxon deletion rate</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="11">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>SDM</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>FITCH</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MW*</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MVR*</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="11">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p><it>K </it>=</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p><it>k </it>=</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p><it>k </it>=</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p><it>k </it>=</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="11">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p><it>n </it>= 48</p>
                     </c>
                     <c ca="center">
                        <p>&lt; 1</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                     <c ca="center">
                        <p>23</p>
                     </c>
                     <c ca="center">
                        <p>23</p>
                     </c>
                     <c ca="center">
                        <p>21</p>
                     </c>
                     <c ca="center">
                        <p>39</p>
                     </c>
                     <c ca="center">
                        <p>41</p>
                     </c>
                     <c ca="center">
                        <p>&lt; 1</p>
                     </c>
                     <c ca="center">
                        <p>&lt; 