<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-7-529</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Methodology article</dochead>
      <bibl>
         <title>
            <p>Fast estimation of the difference between two PAM/JTT evolutionary distances in triplets of homologous sequences</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Dessimoz</snm>
               <fnm>Christophe</fnm>
               <insr iid="I1"/>
               <email>cdessimoz@inf.ethz.ch</email>
            </au>
            <au id="A2">
               <snm>Gil</snm>
               <fnm>Manuel</fnm>
               <insr iid="I1"/>
               <email>mgil@inf.ethz.ch</email>
            </au>
            <au id="A3">
               <snm>Schneider</snm>
               <fnm>Adrian</fnm>
               <insr iid="I1"/>
               <email>schneadr@inf.ethz.ch</email>
            </au>
            <au id="A4">
               <snm>Gonnet</snm>
               <mi>H</mi>
               <fnm>Gaston</fnm>
               <insr iid="I1"/>
               <email>gonnet@inf.ethz.ch</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>ETH Zurich, Institute of Computational Science, 8092 Z&#252;rich, Switzerland</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2006</pubdate>
         <volume>7</volume>
         <issue>1</issue>
         <fpage>529</fpage>
         <url>http://www.biomedcentral.com/1471-2105/7/529</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17147817</pubid>
               <pubid idtype="doi">10.1186/1471-2105-7-529</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>14</day>
               <month>8</month>
               <year>2006</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>05</day>
               <month>12</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>05</day>
               <month>12</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>Dessimoz et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The estimation of the difference between two evolutionary distances within a triplet of homologs is a common operation that is used for example to determine which of two sequences is closer to a third one. The most accurate method is currently maximum likelihood over the entire triplet. However, this approach is relatively time consuming.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We show that an alternative estimator, based on pairwise estimates and therefore much faster to compute, has almost the same statistical power as the maximum likelihood estimator. We also provide a numerical approximation for its variance, which could otherwise only be estimated through an expensive re-sampling approach such as bootstrapping. An extensive simulation demonstrates that the approximation delivers precise confidence intervals. To illustrate the possible applications of these results, we show how they improve the detection of asymmetric evolution, and the identification of the closest relative to a given sequence in a group of homologs.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The results presented in this paper constitute a basis for large-scale protein cross-comparisons of pairwise evolutionary distances.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The estimation of evolutionary distances between biological sequences is at the basis of many bioinformatics problems: it plays a particularly important role in phylogenetic tree inference <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp> and in an increasing number of comparative genomics analyses over large sets of genes or proteins (e.g. <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>). The most accurate way of estimating evolutionary distances is currently maximum likelihood, but the procedure is so time-consuming that is hardly practical when dealing with large datasets. In such cases, complexity is often tackled by working on the basis of individual pairs, such as in distance tree methods or in the "all-against-all" at the beginning of many comparative genomics analyses. However, by estimating an evolutionary distance for each pair individually, no knowledge about the covariance of distance estimates with common evolution can be directly obtained. Thus, when comparing pairwise distances among related sequences, for instance to infer which of two homologs is closer to a third one, confidence intervals cannot be derived directly from the pairwise estimates.</p>
         <p>The present article investigates this fundamental problem of estimating the difference between two distances in a triplet of homologs (Fig. <figr fid="F1">1</figr>). We compare the standard multivariate maximum likelihood approach with a much faster estimator based on pairwise distances, and present a formula to estimate its variance. As two examples of applications, we show how our results improve the detection of asymmetric evolution and the identification of the closest relative in a group of homologs. But first, we briefly review the Markovian model of evolution and maximum likelihood estimation of distances.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Unrooted tree topology of all triplets of homologs</p>
            </caption>
            <text>
               <p><b>Unrooted tree topology of all triplets of homologs</b>. Sequences <it>X</it>, <it>Y </it>and <it>Z </it>originating from <it>O</it>. The problem addressed here is the estimation of the difference &#916; = <it>d</it><sub><it>XY </it></sub>- <it>d</it><sub><it>XZ </it></sub>= <it>d</it><sub><it>OY </it></sub>- <it>d</it><sub><it>OZ</it></sub></p>
            </text>
            <graphic file="1471-2105-7-529-1"/>
         </fig>
         <sec>
            <st>
               <p>PAM model of sequence evolution</p>
            </st>
            <p>The evolutionary distance between two biological sequences is generally based on the assumption of a first-order Markovian process of amino acid evolution. This implies two biological assumptions, common to all standard models of evolution: no memory and position-independence. The substitutional processes are described in the form of substitution matrices, defining mutation probabilities from each character to every other character for a given evolutionary distance. These matrices are either parametrical models of sequence evolution or empirically based substitution matrices. Parametrical models are often employed for nucleotide substitution (e.g. Jukes-Cantor <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> or Hasegawa-Kishino-Yano <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>), while empirical matrices (based on counted substitutions of large sets of sequence alignments) are widely used for peptide replacements in proteins. Pioneered by Dayhoff in the 1970s <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, these models have been improved with more sequence data becoming available in the 1990s (e.g. the updated Dayhoff matrices by Gonnet-Cohen-Benner <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> or Jones-Taylor-Thornton (JTT) <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>). Codon substitutions have been described by parametrical (e.g. <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>) as well as empirical (e.g. <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>) matrices.</p>
            <p>Because of the additivity of distances computed under the Markovian model of sequence evolution. substitution matrices for a wide range of evolutionary distances can be derived from a single substitution matrix <it>M</it>(<it>d</it><sub>0</sub>) through the equation <it>M</it>(<it>d</it><sub>0</sub>)<sup><it>x </it></sup>= <it>M</it>(<it>xd</it><sub>0</sub>), which is a special form of the Chapman-Kolmogorov equation for Markov chains. It is common and computationally more efficient to formulate this process in terms of a rate matrix <it>Q </it>from which the probability matrices for distance <it>d </it>are derived as <it>M</it>(<it>d</it>) = <it>e</it><sup><it>dQ</it></sup>. We normally measure <it>d </it>in PAM units <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, which completely defines <it>Q</it>.</p>
         </sec>
         <sec>
            <st>
               <p>Maximum likelihood estimation</p>
            </st>
            <p>Evolutionary distances are best estimated by maximum likelihood (ML). In case of a pair of sequences, the ML estimation is well known and practical (see <it>Methods </it>part). When more sequences are under consideration, the complexity of distance estimation by ML increases very steeply, mainly because it requires a multiple sequence alignment (MSA) and the inference of the phylogenetic tree topology, two difficult procedures for which the optimal solution can currently only be computed in exponential time with respect to the number of sequences. A common strategy for tackling this problem is to work on the basis of pairs, such as in distance tree methods. In this article, we focus on the specific problem of estimating, in a triplet of homologs <it>X</it>,<it>Y</it>,<it>Z </it>(Fig. <figr fid="F1">1</figr>). the difference &#916; between two distances <it>d</it><sub><it>XY </it></sub>and <it>d</it><sub><it>XZ</it></sub>. In such case, the multidimensional ML approach over the triplet is still practical. We call the estimator of &#916; obtained by this method <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>triplet</it></sub>. Alternatively, &#916; can be estimated by a simple algebraic relation over pairwise distances over <it>X</it>, <it>Y</it>, <it>Z </it>estimated individually. We call this alternative estimator <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise</it></sub>. Details about the computation of <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>triplet </it></sub>and <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise </it></sub>are provided in the <it>Methods </it>section.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <p>In the present section, we compare the estimators <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>triplet </it></sub>and <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise</it></sub>, and introduce a numerical approximation to estimate the variance of <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise</it></sub>, and show that it gives accurate confidence intervals. Finally, we describe two applications of the results.</p>
         <sec>
            <st>
               <p>Comparison between the two estimators</p>
            </st>
            <p>In terms of computational complexity, the two estimators differ significantly. Given <it>m </it>sequences of length <it>n</it>, <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>triplet </it></sub>requires the separate treatment of each <it>O</it>(<it>m</it><sup>3</sup>) triplet, and considering that an optimal 3-way alignment by dynamic programming (DP) is <it>O</it>(<it>n</it><sup>3</sup>), the time complexity is <it>O</it>(<it>m</it><sup>3</sup><it>n</it><sup>3</sup>). In contrast, all <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise </it></sub>can be computed on the basis of <it>O</it>(<it>m</it><sup>2</sup>) pairs of sequences aligned by DP in <it>O</it>(<it>n</it><sup>2</sup>), yielding a time complexity of <it>O</it>(<it>n</it><sup>2</sup><it>m</it><sup>2</sup>). Typically, whenever an analysis involves more than a few thousand proteins, millions of triplets have to be considered and <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise </it></sub>is the only practical approach of the two. In terms of accuracy, both estimators are asymptotically unbiased: in the case of <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>triplet</it></sub>, it is a property of the ML estimator, while in the case of <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise</it></sub>, it is the consequence of the linearity of the expected value (see <it>Methods</it>). We compared the two estimators by simulation over a large number of triplets (length: 300 AA), generated randomly according to the PAM model of evolution with different distances <it>d</it><sub><it>OX</it></sub>, <it>d</it><sub><it>OY</it></sub>, <it>d</it><sub><it>OZ </it></sub>(Fig. <figr fid="F1">1</figr>). In each experiment, both estimators were converging toward the correct value for the difference, which confirms that the asymptotic behavior is a reasonable assumption for protein sequences of typical length. In terms of statistical power; surprisingly, the observed variance of the estimates obtained by <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise </it></sub>was on average less than 1% larger than the observed variance of the ML estimator over the triplet, suggesting that <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise</it></sub>, although much faster to compute, is on average almost as accurate as <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>triplet</it></sub>.</p>
            <p>The variance of <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>triplet </it></sub>can be computed exactly (see <it>Methods </it>section). But there is no direct estimator of the variance of <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise</it></sub>, since it results from an algebraic relation over pairwise distances estimated individually, whose covariances are therefore unknown. There are indirect ways of estimating that variance, through the sampling distribution when doing simulation such as the one mentioned above, or bootstrapping when handling real data. However, such procedures are very time consuming. To overcome this problem, we devised a numerical approximation of <it>&#963;</it><sup>2</sup>(<m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise</it></sub>) as function of the pairwise distance estimates.</p>
         </sec>
         <sec>
            <st>
               <p>Numerical approximation of <it>&#963;</it><sup>2</sup>(<m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise</it></sub>)</p>
            </st>
            <p>In essence, the numerical approximation described here was obtained through regression over a large number of samples. We settled for this approach after discovering that the analytical solution to this problem, even when using a simpler model of evolution (all amino-acid mutations with equal probability). requires solving a polynomial of degree 23. The details of this investigation are reported in the <it>Appendix</it>. In view of this inherent complexity, the regression cannot be exact, but it turns out to be a surprisingly precise numerical approximation for comparisons that involve proteins that have an evolutionary distance smaller than 250 PAM units, which corresponds to percentage sequence identity greater or equal to 19.68%. We generated random triplets in the following way: a random-length (uniform 100..500) sequence was chosen as the origin <it>O</it>. Three random PAM distances (uniform 1..125) were selected for <it>d</it><sub><it>OX</it></sub>, <it>d</it><sub><it>OY </it></sub>and <it>d</it><sub><it>OZ</it></sub>. The sequence <it>O </it>was mutated according to these distances to obtain <it>X</it>,<it>Y </it>and <it>Z</it>, our triplet. We generated about 30,000 triplets for three types of scoring matrix: updated Dayhoff matrices <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, DNA for coding genes and JTT <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. The DNA scoring matrices were computed from a very large set of entire coding gene alignments from mammals. It is used in the OMA project <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> to align entire coding genes and is based on a 4-symbol alphabet. For each triplet, we computed pairwise distance estimates and their variances as input for the approximation. Given that <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise </it></sub>is almost as powerful as <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>triplet</it></sub>, we computed and used <it>&#963;</it><sup>2 </sup>(<m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>triplet</it></sub>) as reference value for <it>&#963;</it><sup>2</sup>(<m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise</it></sub>).</p>
            <p>We examined a large number of regressions and one approximation stood out of the rest due to its efficiency, low average error and other minor indications. Table <tblr tid="T1">1</tblr> shows the coefficients of the approximation for the three types of scoring matrices.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Coefficient of the approximation of <it>&#963;</it><sup>2</sup>(<m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise</it></sub>)</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c ca="left">
                        <p>Type</p>
                     </c>
                     <c ca="center">
                        <p><m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>XY </it></sub>+ <m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>XZ</it></sub></p>
                     </c>
                     <c ca="center">
                        <p><it>&#963;</it><sup>2</sup>(<m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>XY</it></sub>) + <it>&#963;</it><sup>2</sup>(<m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>XZ</it></sub>)</p>
                     </c>
                     <c ca="center">
                        <p>
                           <m:math name="1471-2105-7-529-i3" xmlns:m="http://www.w3.org/1998/Math/MathML">
                              <m:semantics>
                                 <m:mrow>
                                    <m:msubsup>
                                       <m:mover accent="true">
                                          <m:mi>d</m:mi>
                                          <m:mo>^</m:mo>
                                       </m:mover>
                                       <m:mrow>
                                          <m:mi>Y</m:mi>
                                          <m:mi>Z</m:mi>
                                       </m:mrow>
                                       <m:mn>2</m:mn>
                                    </m:msubsup>
                                 </m:mrow>
                                 <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcamaaDaaaleaacqWGzbqwcqWGAbGwaeaacqaIYaGmaaaaaa@31A4@</m:annotation>
                              </m:semantics>
                           </m:math>
                        </p>
                     </c>
                     <c ca="center">
                        <p><it>&#963;</it><sup>2</sup>(<m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>YZ</it></sub>)</p>
                     </c>
                     <c ca="center">
                        <p><it>&#963;</it><sup>2</sup>(<m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>XY</it></sub>) <it>&#963;</it><sup>2</sup>(<m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>XZ</it></sub>)</p>
                     </c>
                     <c ca="center">
                        <p>error</p>
                     </c>
                     <c ca="center">
                        <p>dim</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Day</p>
                     </c>
                     <c ca="center">
                        <p>-1.3090</p>
                     </c>
                     <c ca="center">
                        <p>1.0435</p>
                     </c>
                     <c ca="center">
                        <p>0.6895</p>
                     </c>
                     <c ca="center">
                        <p>-0.3339</p>
                     </c>
                     <c ca="center">
                        <p>0.1590</p>
                     </c>
                     <c ca="center">
                        <p>0.087</p>
                     </c>
                     <c ca="center">
                        <p>2.13</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>DNA</p>
                     </c>
                     <c ca="center">
                        <p>-1.2449</p>
                     </c>
                     <c ca="center">
                        <p>1.0933</p>
                     </c>
                     <c ca="center">
                        <p>0.6591</p>
                     </c>
                     <c ca="center">
                        <p>-0.3026</p>
                     </c>
                     <c ca="center">
                        <p>0.1181</p>
                     </c>
                     <c ca="center">
                        <p>0.098</p>
                     </c>
                     <c ca="center">
                        <p>2.13</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>JTT</p>
                     </c>
                     <c ca="center">
                        <p>-1.2921</p>
                     </c>
                     <c ca="center">
                        <p>1.0978</p>
                     </c>
                     <c ca="center">
                        <p>0.6741</p>
                     </c>
                     <c ca="center">
                        <p>-0.3065</p>
                     </c>
                     <c ca="center">
                        <p>0.1144</p>
                     </c>
                     <c ca="center">
                        <p>0.080</p>
                     </c>
                     <c ca="center">
                        <p>2.10</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Coefficients of the regression on the logarithms for the three types of scoring matrices. The error column shows the mean error, which by virtue of being a regression on logarithms is very close to the relative error.</p>
               </tblfn>
            </tbl>
            <p>For example, the approximation for DNA variances is</p>
            <p>
               <m:math name="1471-2105-7-529-i4" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mtable>
                           <m:mtr>
                              <m:mtd>
                                 <m:mrow>
                                    <m:msup>
                                       <m:mover accent="true">
                                          <m:mi>&#963;</m:mi>
                                          <m:mo>&#732;</m:mo>
                                       </m:mover>
                                       <m:mn>2</m:mn>
                                    </m:msup>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:msub>
                                       <m:mover accent="true">
                                          <m:mi>&#916;</m:mi>
                                          <m:mo>^</m:mo>
                                       </m:mover>
                                       <m:mrow>
                                          <m:mi>p</m:mi>
                                          <m:mi>a</m:mi>
                                          <m:mi>i</m:mi>
                                          <m:mi>r</m:mi>
                                          <m:mi>w</m:mi>
                                          <m:mi>i</m:mi>
                                          <m:mi>s</m:mi>
                                          <m:mi>e</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mo stretchy="false">)</m:mo>
                                    <m:mo>=</m:mo>
                                    <m:mfrac>
                                       <m:mrow>
                                          <m:msubsup>
                                             <m:mover accent="true">
                                                <m:mi>d</m:mi>
                                                <m:mo>^</m:mo>
                                             </m:mover>
                                             <m:mrow>
                                                <m:mi>Y</m:mi>
                                                <m:mi>Z</m:mi>
                                             </m:mrow>
                                             <m:mrow>
                                                <m:mn>1.3182</m:mn>
                                             </m:mrow>
                                          </m:msubsup>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:msup>
                                             <m:mi>&#963;</m:mi>
                                             <m:mn>2</m:mn>
                                          </m:msup>
                                          <m:msup>
                                             <m:mrow>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:msub>
                                                   <m:mover accent="true">
                                                      <m:mi>d</m:mi>
                                                      <m:mo>^</m:mo>
                                                   </m:mover>
                                                   <m:mrow>
                                                      <m:mi>Y</m:mi>
                                                      <m:mi>Z</m:mi>
                                                   </m:mrow>
                                                </m:msub>
                                                <m:mo stretchy="false">)</m:mo>
                                             </m:mrow>
                                             <m:mrow>
                                                <m:mn>0.3026</m:mn>
                                             </m:mrow>
                                          </m:msup>
                                       </m:mrow>
                                    </m:mfrac>
                                    <m:mo>.</m:mo>
                                 </m:mrow>
                              </m:mtd>
                           </m:mtr>
                           <m:mtr>
                              <m:mtd>
                                 <m:mrow>
                                    <m:mfrac>
                                       <m:mrow>
                                          <m:msup>
                                             <m:mrow>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:msup>
                                                   <m:mi>&#963;</m:mi>
                                                   <m:mn>2</m:mn>
                                                </m:msup>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:msub>
                                                   <m:mover accent="true">
                                                      <m:mi>d</m:mi>
                                                      <m:mo>^</m:mo>
                                                   </m:mover>
                                                   <m:mrow>
                                                      <m:mi>X</m:mi>
                                                      <m:mi>Y</m:mi>
                                                   </m:mrow>
                                                </m:msub>
                                                <m:mo stretchy="false">)</m:mo>
                                                <m:mo>+</m:mo>
                                                <m:msup>
                                                   <m:mi>&#963;</m:mi>
                                                   <m:mn>2</m:mn>
                                                </m:msup>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:msub>
                                                   <m:mover accent="true">
                                                      <m:mi>d</m:mi>
                                                      <m:mo>^</m:mo>
                                                   </m:mover>
                                                   <m:mrow>
                                                      <m:mi>X</m:mi>
                                                      <m:mi>Z</m:mi>
                                                   </m:mrow>
                                                </m:msub>
                                                <m:mo stretchy="false">)</m:mo>
                                                <m:mo stretchy="false">)</m:mo>
                                             </m:mrow>
                                             <m:mrow>
                                                <m:mn>1.0933</m:mn>
                                             </m:mrow>
                                          </m:msup>
                                          <m:msup>
                                             <m:mrow>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:msup>
                                                   <m:mi>&#963;</m:mi>
                                                   <m:mn>2</m:mn>
                                                </m:msup>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:msub>
                                                   <m:mover accent="true">
                                                      <m:mi>d</m:mi>
                                                      <m:mo>^</m:mo>
                                                   </m:mover>
                                                   <m:mrow>
                                                      <m:mi>X</m:mi>
                                                      <m:mi>Y</m:mi>
                                                   </m:mrow>
                                                </m:msub>
                                                <m:mo stretchy="false">)</m:mo>
                                                <m:msup>
                                                   <m:mi>&#963;</m:mi>
                                                   <m:mn>2</m:mn>
                                                </m:msup>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:msub>
                                                   <m:mover accent="true">
                                                      <m:mi>d</m:mi>
                                                      <m:mo>^</m:mo>
                                                   </m:mover>
                                                   <m:mrow>
                                                      <m:mi>X</m:mi>
                                                      <m:mi>Z</m:mi>
                                                   </m:mrow>
                                                </m:msub>
                                                <m:mo stretchy="false">)</m:mo>
                                                <m:mo stretchy="false">)</m:mo>
                                             </m:mrow>
                                             <m:mrow>
                                                <m:mn>0.1181</m:mn>
                                             </m:mrow>
                                          </m:msup>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:msup>
                                             <m:mrow>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:msub>
                                                   <m:mover accent="true">
                                                      <m:mi>d</m:mi>
                                                      <m:mo>^</m:mo>
                                                   </m:mover>
                                                   <m:mrow>
                                                      <m:mi>X</m:mi>
                                                      <m:mi>Y</m:mi>
                                                   </m:mrow>
                                                </m:msub>
                                                <m:mo>+</m:mo>
                                                <m:msub>
                                                   <m:mover accent="true">
                                                      <m:mi>d</m:mi>
                                                      <m:mo>^</m:mo>
                                                   </m:mover>
                                                   <m:mrow>
                                                      <m:mi>X</m:mi>
                                                      <m:mi>Z</m:mi>
                                                   </m:mrow>
                                                </m:msub>
                                                <m:mo stretchy="false">)</m:mo>
                                             </m:mrow>
                                             <m:mrow>
                                                <m:mn>1.2449</m:mn>
                                             </m:mrow>
                                          </m:msup>
                                       </m:mrow>
                                    </m:mfrac>
                                 </m:mrow>
                              </m:mtd>
                           </m:mtr>
                        </m:mtable>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaafaqabeGabaaabaacciGaf83WdmNbaGaadaahaaWcbeqaaiabikdaYaaakiabcIcaOiqbfs5aezaajaWaaSbaaSqaaiabdchaWjabdggaHjabdMgaPjabdkhaYjabdEha3jabdMgaPjabdohaZjabdwgaLbqabaGccqGGPaqkcqGH9aqpdaWcaaqaaiqbdsgaKzaajaWaa0baaSqaaiabdMfazjabdQfaAbqaaiabigdaXiabc6caUiabiodaZiabigdaXiabiIda4iabikdaYaaaaOqaaiab=n8aZnaaCaaaleqabaGaeGOmaidaaOGaeiikaGIafmizaqMbaKaadaWgaaWcbaGaemywaKLaemOwaOfabeaakiabcMcaPmaaCaaaleqabaGaeGimaaJaeiOla4IaeG4mamJaeGimaaJaeGOmaiJaeGOnaydaaaaakiabc6caUaqaamaalaaabaGaeiikaGIae83Wdm3aaWbaaSqabeaacqaIYaGmaaGccqGGOaakcuWGKbazgaqcamaaBaaaleaacqWGybawcqWGzbqwaeqaaOGaeiykaKIaey4kaSIae83Wdm3aaWbaaSqabeaacqaIYaGmaaGccqGGOaakcuWGKbazgaqcamaaBaaaleaacqWGybawcqWGAbGwaeqaaOGaeiykaKIaeiykaKYaaWbaaSqabeaacqaIXaqmcqGGUaGlcqaIWaamcqaI5aqocqaIZaWmcqaIZaWmaaGccqGGOaakcqWFdpWCdaahaaWcbeqaaiabikdaYaaakiabcIcaOiqbdsgaKzaajaWaaSbaaSqaaiabdIfayjabdMfazbqabaGccqGGPaqkcqWFdpWCdaahaaWcbeqaaiabikdaYaaakiabcIcaOiqbdsgaKzaajaWaaSbaaSqaaiabdIfayjabdQfaAbqabaGccqGGPaqkcqGGPaqkdaahaaWcbeqaaiabicdaWiabc6caUiabigdaXiabigdaXiabiIda4iabigdaXaaaaOqaaiabcIcaOiqbdsgaKzaajaWaaSbaaSqaaiabdIfayjabdMfazbqabaGccqGHRaWkcuWGKbazgaqcamaaBaaaleaacqWGybawcqWGAbGwaeqaaOGaeiykaKYaaWbaaSqabeaacqaIXaqmcqGGUaGlcqaIYaGmcqaI0aancqaI0aancqaI5aqoaaaaaaaaaaa@9B87@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>Readers familiar with numerical analysis will find an analogy between the approximation presented here and standard approximations for transcendental functions. For example, it is customary to approximate <it>exp</it>(<it>x</it>) through a quotient of polynomials <it>p</it>(<it>x</it>)<it>/q</it>(<it>x</it>), for some limited range of x.</p>
            <p>The relative error is in all the three cases less than 10%. Furthermore, since we normally use the square root of the variance, the relative error is in such cases half of the indicated. The last column indicates the dimension of the approximation which should be 2 in perfect conditions, and is indeed quite close.</p>
            <p>The fact that very different matrices have very similar coefficients, the low error and the almost correct dimensionality reassures us of the quality of the approximation.</p>
            <p>To test the accuracy/applicability of the approximation, as well as the other two methods to obtain the variance, we compared the 95 and 99% confidence level obtained using the appropriate number of standard deviations to the actual percentage of correct decisions obtained in a simulation over 400, 000 protein triplets generated as described above. The results are shown in Table <tblr tid="T2">2</tblr>.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Verification of accuracy of confidence intervals</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p><it>k </it>= 1.960</p>
                     </c>
                     <c ca="center">
                        <p><it>k </it>= 2.576</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>|<m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>triplet </it></sub>- &#916;| > <it>k</it>&#183;<it>&#963;</it>(<m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>triplet</it></sub>)</p>
                     </c>
                     <c ca="center">
                        <p>0.95129 &#177; 0.00067</p>
                     </c>
                     <c ca="center">
                        <p>0.99062 &#177; 0.00030</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>|<m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise </it></sub>- &#916;| > <it>k</it>&#183;<it>&#963;</it><sub><it>bootstrap</it></sub>(<m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise</it></sub>)</p>
                     </c>
                     <c ca="center">
                        <p>0.9511 &#177; 0.0020</p>
                     </c>
                     <c ca="center">
                        <p>0.99001 &#177; 0.00091</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>|<m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise </it></sub>- &#916;| > <it>k</it>&#183;<it>&#963;</it>(<m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>triplet</it></sub>)</p>
                     </c>
                     <c ca="center">
                        <p>0.94641 &#177; 0.00070</p>
                     </c>
                     <c ca="center">
                        <p>0.98896 &#177; 0.00032</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>|<m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise </it></sub>- &#916;| > <it>k</it>&#183;<m:math name="1471-2105-7-529-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#963;</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacuWFdpWCgaacaaaa@2E85@</m:annotation></m:semantics></m:math>(<m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise</it></sub>)</p>
                     </c>
                     <c ca="center">
                        <p>0.94808 &#177; 0.00069</p>
                     </c>
                     <c ca="center">
                        <p>0.98953 &#177; 0.00032</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>|<m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise </it></sub>- &#916;| > <it>k</it>&#183;<it>&#963;</it><sub><it>ind</it></sub>(<m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise</it></sub>)</p>
                     </c>
                     <c ca="center">
                        <p>0.98137 &#177; 0.00042</p>
                     </c>
                     <c ca="center">
                        <p>0.99774 &#177; 0.00015</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Comparison among the different methods to estimate the variance of the two estimators <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>triplet </it></sub>and <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise</it></sub>, resulting from a simulation using updated Dayhoff matrices over 400,000 proteins triplets, except for the bootstrapping method, based on 40,000 samples. The first column tests the 95% confidence interval, the second the 99% confidence interval.</p>
               </tblfn>
            </tbl>
            <p>As expected, the ML estimator over the entire triplet (first row) yields a precise variance estimate. On the other hand, we see that assuming independence for the estimation of the variance (last row) leads to very inaccurate confidence intervals. Estimating the variance of <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise </it></sub>by bootstrapping (10,000 re-samples) gives good confidence intervals, but the procedure is even more computationally intensive than <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>triplet</it></sub>, and therefore of little practical use in the present context. Using <m:math name="1471-2105-7-529-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#963;</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacuWFdpWCgaacaaaa@2E85@</m:annotation></m:semantics></m:math><sup>2</sup>(<m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise</it></sub>) in conjunction with the variance of the ML estimator works remarkably well (third and fourth row). And surprisingly, applying the numerical approximation (fourth row) happened to give slightly more accurate results than the exact triplet variance (third row).</p>
            <p>Finally, we compared the different estimators on real biological sequences, using data obtained from the OMA orthologs project <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, Triplets of orthologous sequences from various eukaryotes were randomly selected and aligned using the multiple sequence alignment package from <it>Darwin </it><abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. All positions containing gaps were excluded, and variances were then estimated on the ungapped triplets using the various estimators (Fig. <figr fid="F2">2</figr>). The variance estimates from the approximation formula deviate very little from the results obtained by the two more expensive methods &#8211; for simulated as well as empirical alignments. Additionally, the plots illustrate the high correspondence between the results from the ML estimation and the bootstrapping, and show that the estimator based on an assumption of independence often yields overestimates of the variance. The difference between simulated and empirical data probably arises from the limitations of the Markovian model of evolution. Worth noticing is that the agreement of our estimator with bootstrapping is comparable to the one of the ML variance estimator: this implies that our approximation has a similar robustness when applied to real data.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Scatter plots comparing the variance estimators</p>
               </caption>
               <text>
                  <p><b>Scatter plots comparing the variance estimators</b>. The upper-left plot shows the strong agreement between <it>&#963;</it><sup>2</sup>(<m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>triplet</it></sub>) and our approximation <it>&#963;</it><sup>2</sup>(<m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise</it></sub>). From the upper-right and the lower-left plots, it can be seen that both have similar correlation with <m:math name="1471-2105-7-529-i6" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mrow><m:mi>b</m:mi><m:mi>o</m:mi><m:mi>o</m:mi><m:mi>t</m:mi><m:mi>s</m:mi><m:mi>t</m:mi><m:mi>r</m:mi><m:mi>a</m:mi><m:mi>p</m:mi></m:mrow><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaemOyaiMaem4Ba8Maem4Ba8MaemiDaqNaem4CamNaemiDaqNaemOCaiNaemyyaeMaemiCaahabaGaeGOmaidaaaaa@3C22@</m:annotation></m:semantics></m:math>(<m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise</it></sub>). Finally, the lower-right plot confirms that variance estimation under the assumption of independence can yield a large overestimation of the correct variance.</p>
               </text>
               <graphic file="1471-2105-7-529-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Applications</p>
            </st>
            <p>In the following, we provide two examples of applications that benefit from the increase in statistical power of the estimator <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise </it></sub>enabled by the approximation: detection of asymmetric evolution and identification of the closest relative in a set of homologs. Furthermore, in <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, we show how our result can be used in the context of paralogy detection.</p>
            <p>We first define three indicator functions that will be used in these comparisons. They decide whether the pair of proteins <it>X</it>, <it>Y </it>is significantly closer than <it>X</it>, <it>Z </it>at the confidence level expressed by the number of standard deviations <it>k</it>. The first and second ones both use the estimator <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise</it></sub>, but the first definition uses as variance of the estimate the upper bound that is obtained by assuming independence of <m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>XY </it></sub>and <m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>XZ </it></sub>(see <it>Methods</it>), whereas the second use the approximation <m:math name="1471-2105-7-529-i5" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#963;</m:mi><m:mo>&#732;</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacuWFdpWCgaacaaaa@2E85@</m:annotation></m:semantics></m:math><sup>2</sup>(<m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise</it></sub>) of the variance. The third indicator function uses the estimator <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>triplet</it></sub>.</p>
            <p>
               <m:math name="1471-2105-7-529-i7" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mi>c</m:mi>
                        <m:mi>l</m:mi>
                        <m:mi>o</m:mi>
                        <m:mi>s</m:mi>
                        <m:mi>e</m:mi>
                        <m:msub>
                           <m:mi>r</m:mi>
                           <m:mrow>
                              <m:mi>i</m:mi>
                              <m:mi>n</m:mi>
                              <m:mi>d</m:mi>
                           </m:mrow>
                        </m:msub>
                        <m:mo stretchy="false">(</m:mo>
                        <m:mi>X</m:mi>
                        <m:mo>,</m:mo>
                        <m:mi>Y</m:mi>
                        <m:mo>,</m:mo>
                        <m:mi>Z</m:mi>
                        <m:mo>,</m:mo>
                        <m:mi>k</m:mi>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mo>=</m:mo>
                        <m:mrow>
                           <m:mo>{</m:mo>
                           <m:mrow>
                              <m:mtable columnalign="left">
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mtext>true</m:mtext>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mtext>if&#160;</m:mtext>
                                          <m:msub>
                                             <m:mover accent="true">
                                                <m:mi>&#916;</m:mi>
                                                <m:mo>^</m:mo>
                                             </m:mover>
                                             <m:mrow>
                                                <m:mi>p</m:mi>
                                                <m:mi>a</m:mi>
                                                <m:mi>i</m:mi>
                                                <m:mi>r</m:mi>
                                                <m:mi>w</m:mi>
                                                <m:mi>i</m:mi>
                                                <m:mi>s</m:mi>
                                                <m:mi>e</m:mi>
                                             </m:mrow>
                                          </m:msub>
                                          <m:mo>&lt;</m:mo>
                                          <m:mo>&#8722;</m:mo>
                                          <m:mi>k</m:mi>
                                          <m:mo>&#8901;</m:mo>
                                          <m:msub>
                                             <m:mi>&#963;</m:mi>
                                             <m:mrow>
                                                <m:mi>i</m:mi>
                                                <m:mi>n</m:mi>
                                                <m:mi>d</m:mi>
                                             </m:mrow>
                                          </m:msub>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:msub>
                                             <m:mover accent="true">
                                                <m:mi>&#916;</m:mi>
                                                <m:mo>^</m:mo>
                                             </m:mover>
                                             <m:mrow>
                                                <m:mi>p</m:mi>
                                                <m:mi>a</m:mi>
                                                <m:mi>i</m:mi>
                                                <m:mi>r</m:mi>
                                                <m:mi>w</m:mi>
                                                <m:mi>i</m:mi>
                                                <m:mi>s</m:mi>
                                                <m:mi>e</m:mi>
                                             </m:mrow>
                                          </m:msub>
                                          <m:mo stretchy="false">)</m:mo>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mtext>false</m:mtext>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mtext>otherwise</m:mtext>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                              </m:mtable>
                           </m:mrow>
                        </m:mrow>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGJbWycqWGSbaBcqWGVbWBcqWGZbWCcqWGLbqzcqWGYbGCdaWgaaWcbaGaemyAaKMaemOBa4MaemizaqgabeaakiabcIcaOiabdIfayjabcYcaSiabdMfazjabcYcaSiabdQfaAjabcYcaSiabdUgaRjabcMcaPiabg2da9maaceqabaqbaeaabiGaaaqaaiabbsha0jabbkhaYjabbwha1jabbwgaLbqaaiabbMgaPjabbAgaMjabbccaGiqbfs5aezaajaWaaSbaaSqaaiabdchaWjabdggaHjabdMgaPjabdkhaYjabdEha3jabdMgaPjabdohaZjabdwgaLbqabaGccqGH8aapcqGHsislcqWGRbWAcqGHflY1iiGacqWFdpWCdaWgaaWcbaGaemyAaKMaemOBa4MaemizaqgabeaakiabcIcaOiqbfs5aezaajaWaaSbaaSqaaiabdchaWjabdggaHjabdMgaPjabdkhaYjabdEha3jabdMgaPjabdohaZjabdwgaLbqabaGccqGGPaqkaeaacqqGMbGzcqqGHbqycqqGSbaBcqqGZbWCcqqGLbqzaeaacqqGVbWBcqqG0baDcqqGObaAcqqGLbqzcqqGYbGCcqqG3bWDcqqGPbqAcqqGZbWCcqqGLbqzaaaacaGL7baaaaa@87DB@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>
               <m:math name="1471-2105-7-529-i8" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mi>c</m:mi>
                        <m:mi>l</m:mi>
                        <m:mi>o</m:mi>
                        <m:mi>s</m:mi>
                        <m:mi>e</m:mi>
                        <m:msub>
                           <m:mi>r</m:mi>
                           <m:mrow>
                              <m:mi>a</m:mi>
                              <m:mi>p</m:mi>
                              <m:mi>p</m:mi>
                           </m:mrow>
                        </m:msub>
                        <m:mo stretchy="false">(</m:mo>
                        <m:mi>X</m:mi>
                        <m:mo>,</m:mo>
                        <m:mi>Y</m:mi>
                        <m:mo>,</m:mo>
                        <m:mi>Z</m:mi>
                        <m:mo>,</m:mo>
                        <m:mi>k</m:mi>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mo>=</m:mo>
                        <m:mrow>
                           <m:mo>{</m:mo>
                           <m:mrow>
                              <m:mtable columnalign="left">
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mtext>true</m:mtext>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mtext>if&#160;</m:mtext>
                                          <m:msub>
                                             <m:mover accent="true">
                                                <m:mi>&#916;</m:mi>
                                                <m:mo>^</m:mo>
                                             </m:mover>
                                             <m:mrow>
                                                <m:mi>p</m:mi>
                                                <m:mi>a</m:mi>
                                                <m:mi>i</m:mi>
                                                <m:mi>r</m:mi>
                                                <m:mi>w</m:mi>
                                                <m:mi>i</m:mi>
                                                <m:mi>s</m:mi>
                                                <m:mi>e</m:mi>
                                             </m:mrow>
                                          </m:msub>
                                          <m:mo>&lt;</m:mo>
                                          <m:mo>&#8722;</m:mo>
                                          <m:mi>k</m:mi>
                                          <m:mo>&#8901;</m:mo>
                                          <m:mover accent="true">
                                             <m:mi>&#963;</m:mi>
                                             <m:mo>&#732;</m:mo>
                                          </m:mover>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:msub>
                                             <m:mover accent="true">
                                                <m:mi>&#916;</m:mi>
                                                <m:mo>^</m:mo>
                                             </m:mover>
                                             <m:mrow>
                                                <m:mi>p</m:mi>
                                                <m:mi>a</m:mi>
                                                <m:mi>i</m:mi>
                                                <m:mi>r</m:mi>
                                                <m:mi>w</m:mi>
                                                <m:mi>i</m:mi>
                                                <m:mi>s</m:mi>
                                                <m:mi>e</m:mi>
                                             </m:mrow>
                                          </m:msub>
                                          <m:mo stretchy="false">)</m:mo>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mtext>false</m:mtext>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mtext>otherwise</m:mtext>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                              </m:mtable>
                           </m:mrow>
                        </m:mrow>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGJbWycqWGSbaBcqWGVbWBcqWGZbWCcqWGLbqzcqWGYbGCdaWgaaWcbaGaemyyaeMaemiCaaNaemiCaahabeaakiabcIcaOiabdIfayjabcYcaSiabdMfazjabcYcaSiabdQfaAjabcYcaSiabdUgaRjabcMcaPiabg2da9maaceqabaqbaeaabiGaaaqaaiabbsha0jabbkhaYjabbwha1jabbwgaLbqaaiabbMgaPjabbAgaMjabbccaGiqbfs5aezaajaWaaSbaaSqaaiabdchaWjabdggaHjabdMgaPjabdkhaYjabdEha3jabdMgaPjabdohaZjabdwgaLbqabaGccqGH8aapcqGHsislcqWGRbWAcqGHflY1iiGacuWFdpWCgaacaiabcIcaOiqbfs5aezaajaWaaSbaaSqaaiabdchaWjabdggaHjabdMgaPjabdkhaYjabdEha3jabdMgaPjabdohaZjabdwgaLbqabaGccqGGPaqkaeaacqqGMbGzcqqGHbqycqqGSbaBcqqGZbWCcqqGLbqzaeaacqqGVbWBcqqG0baDcqqGObaAcqqGLbqzcqqGYbGCcqqG3bWDcqqGPbqAcqqGZbWCcqqGLbqzaaaacaGL7baaaaa@83AF@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>
               <m:math name="1471-2105-7-529-i9" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mi>c</m:mi>
                        <m:mi>l</m:mi>
                        <m:mi>o</m:mi>
                        <m:mi>s</m:mi>
                        <m:mi>e</m:mi>
                        <m:msub>
                           <m:mi>r</m:mi>
                           <m:mrow>
                              <m:mi>t</m:mi>
                              <m:mi>r</m:mi>
                              <m:mi>i</m:mi>
                              <m:mi>p</m:mi>
                              <m:mi>l</m:mi>
                              <m:mi>e</m:mi>
                              <m:mi>t</m:mi>
                           </m:mrow>
                        </m:msub>
                        <m:mo stretchy="false">(</m:mo>
                        <m:mi>X</m:mi>
                        <m:mo>,</m:mo>
                        <m:mi>Y</m:mi>
                        <m:mo>,</m:mo>
                        <m:mi>Z</m:mi>
                        <m:mo>,</m:mo>
                        <m:mi>k</m:mi>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mo>=</m:mo>
                        <m:mrow>
                           <m:mo>{</m:mo>
                           <m:mrow>
                              <m:mtable columnalign="left">
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mtext>true</m:mtext>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mtext>if&#160;</m:mtext>
                                          <m:msub>
                                             <m:mover accent="true">
                                                <m:mi>&#916;</m:mi>
                                                <m:mo>^</m:mo>
                                             </m:mover>
                                             <m:mrow>
                                                <m:mi>t</m:mi>
                                                <m:mi>r</m:mi>
                                                <m:mi>i</m:mi>
                                                <m:mi>p</m:mi>
                                                <m:mi>l</m:mi>
                                                <m:mi>e</m:mi>
                                                <m:mi>t</m:mi>
                                             </m:mrow>
                                          </m:msub>
                                          <m:mo>&lt;</m:mo>
                                          <m:mo>&#8722;</m:mo>
                                          <m:mi>k</m:mi>
                                          <m:mo>&#8901;</m:mo>
                                          <m:mi>&#963;</m:mi>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:msub>
                                             <m:mover accent="true">
                                                <m:mi>&#916;</m:mi>
                                                <m:mo>^</m:mo>
                                             </m:mover>
                                             <m:mrow>
                                                <m:mi>t</m:mi>
                                                <m:mi>r</m:mi>
                                                <m:mi>i</m:mi>
                                                <m:mi>p</m:mi>
                                                <m:mi>l</m:mi>
                                                <m:mi>e</m:mi>
                                                <m:mi>t</m:mi>
                                             </m:mrow>
                                          </m:msub>
                                          <m:mo stretchy="false">)</m:mo>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                                 <m:mtr columnalign="left">
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mtext>false</m:mtext>
                                       </m:mrow>
                                    </m:mtd>
                                    <m:mtd columnalign="left">
                                       <m:mrow>
                                          <m:mtext>otherwise</m:mtext>
                                       </m:mrow>
                                    </m:mtd>
                                 </m:mtr>
                              </m:mtable>
                           </m:mrow>
                        </m:mrow>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGJbWycqWGSbaBcqWGVbWBcqWGZbWCcqWGLbqzcqWGYbGCdaWgaaWcbaGaemiDaqNaemOCaiNaemyAaKMaemiCaaNaemiBaWMaemyzauMaemiDaqhabeaakiabcIcaOiabdIfayjabcYcaSiabdMfazjabcYcaSiabdQfaAjabcYcaSiabdUgaRjabcMcaPiabg2da9maaceqabaqbaeaabiGaaaqaaiabbsha0jabbkhaYjabbwha1jabbwgaLbqaaiabbMgaPjabbAgaMjabbccaGiqbfs5aezaajaWaaSbaaSqaaiabdsha0jabdkhaYjabdMgaPjabdchaWjabdYgaSjabdwgaLjabdsha0bqabaGccqGH8aapcqGHsislcqWGRbWAcqGHflY1iiGacqWFdpWCcqGGOaakcuqHuoargaqcamaaBaaaleaacqWG0baDcqWGYbGCcqWGPbqAcqWGWbaCcqWGSbaBcqWGLbqzcqWG0baDaeqaaOGaeiykaKcabaGaeeOzayMaeeyyaeMaeeiBaWMaee4CamNaeeyzaugabaGaee4Ba8MaeeiDaqNaeeiAaGMaeeyzauMaeeOCaiNaee4DaCNaeeyAaKMaee4CamNaeeyzaugaaaGaay5Eaaaaaa@86B8@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <sec>
               <st>
                  <p>Asymmetric evolution</p>
               </st>
               <p>After a gene duplication, the two copies can evolve independently. It has been suggested that in many cases, one duplicate maintains the ancestral function while the other is free to evolve and acquire novel functionality <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. This scenario implies that the protein with conserved functionality will undergo less sequence evolution than the one exploring new functionalities.</p>
               <p>Detecting this asymmetric evolution after duplication is an important factor not only for function prediction or orthologs assignment, but also for bringing new insights in our understanding of genome evolution in general (e.g. <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>).</p>
               <p>In order to identify cases of asymmetric evolution, one typically considers three sequences &#8211; the two duplicates (<it>Y </it>and <it>Z</it>)and an out-group (<it>X</it>). Several methods have been developed to test the significance of the unequal lengths of the branches leading from the common ancestor to the two duplicated sequences. Tests on simulated and real data from <it>Arabidopsis thaliana </it>for two of such methods have suggested very low statistical power to detect asymmetric evolution of duplicates <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>.</p>
               <p>The <it>closer </it>indicator function can be used to detect asymmetric evolution. With <it>d</it><sub><it>XY </it></sub>being the distance from the out-group to the closer of the two duplicates and <it>d</it><sub><it>XZ </it></sub>the distance to the other one, <it>closer </it>(<it>X</it>, <it>Y</it>, <it>Z</it>, <it>k</it>) decides if the two duplicated proteins have evolved at significantly different rates. The parameter <it>k </it>can be chosen to reflect the confidence level, e.g. 1.96 for the 95% level.</p>
               <p>We tested the method using all three variants of <it>closer </it>(<it>k </it>= 1.96) on a protein set from a recent publication about whole genome duplication in <it>S. cerevisiae </it><abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. From a set of 450 genes pairs that arose by whole genome duplication, they report 115 cases of one paralog evolving at least 50% faster than the other paralog. The position of the ancestral gene was determined by an out-group gene from <it>K. waltii</it>. Additionally, a set of 76 gene pairs is given where at least one of the <it>S. cerevisiae </it>genes evolved at least 50% faster than the <it>K. waltii </it>homolog.</p>
               <p>The results are summarized in Fig. <figr fid="F3">3</figr>. We first discuss the differences among three variants of <it>closer</it>. As expected, the over estimation of the variance of the estimator in <it>closer</it><sub><it>ind </it></sub>considerably reduces the cases of asymmetry detected in comparison with <it>closer</it><sub><it>app</it></sub>. As for <it>closer</it><sub><it>app</it></sub>and <it>closer</it><sub><it>triplet</it></sub>, they agree on 400 of 450 cases, with 21 cases only reported by <it>closer </it><sub><it>app </it></sub>and 29 only by <it>closer </it><sub><it>triplet</it></sub>. This discrepancy results from the error introduced by the approximation for the estimation of the variance of <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise</it></sub>, but mostly from the inherent differences in the predictions of the two estimators <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise </it></sub>and <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>triplet</it></sub>.</p>
               <fig id="F3">
                  <title>
                     <p>Figure 3</p>
                  </title>
                  <caption>
                     <p>Detection of asymmetric evolution</p>
                  </caption>
                  <text>
                     <p><b>Detection of asymmetric evolution</b>. Detection of Asymmetric Evolution. Comparison between the results of Kellis et al. and the three variants of <it>closer</it>, with <it>k </it>= 1.96. The circles separate cases of significant asymmetry (inside) from insignificant asymmetry (outside). For instance, there were 92 cases where all three variants of <it>closer </it>reported significant asymmetry, while the method of Kellis et al. did not detect significant asymmetry.</p>
                  </text>
                  <graphic file="1471-2105-7-529-3"/>
               </fig>
               <p>If we now compare the predictions of Kellis and colleagues with our results, it appears that in 98 out of 115 cases, their prediction of asymmetric evolution could be confirmed by <it>closer</it><sub><it>app</it></sub>, while with the remaining 17 pairs, our method did not support the asymmetry prediction. It is remarkable, however, that all these 17 pairs belong to the group of the 76 pairs with a fast evolving <it>K. waltii </it>homolog. It seems likely that the uncertainty in placing the origin of the triplet (arising from a longer branch to the out-group) causes rate-based methods as used in <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> to report asymmetric divergence despite the unclear situation. As opposed to that, the distance-based methods presented here, by incorporating the variance of the estimates explicitly, take the uncertainty about the point of origin into account, and therefore give more conservative predictions in these cases.</p>
               <p>Furthermore, <it>closer</it><sub><it>app </it></sub>found 134 additional cases of asymmetry among the remaining 335 gene pairs in the data set. Together with the 98 cases above, this results in 51.6% of all genes arising from the genome duplication event. This is clearly more than the 5% that could be expected from random chance and agrees with previous studies were significant amounts of asymmetrically evolving duplicates have been reported (e.g. <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>).</p>
            </sec>
            <sec>
               <st>
                  <p>Closest homolog without phylogenetic reconstruction</p>
               </st>
               <p>The identification of the closest relative of a protein (or gene) in a set of homologs traditionally requires the reconstruction of the corresponding phylogenetic tree. However, building gene trees remains a time consuming and error-prone task, thus methods based on pairwise evolutionary distance estimates are attractive. In this section, we show that using the variance approximation presented above can boost the statistical power of PAM distance comparisons to determine the closest homolog.</p>
               <p>In simple contexts, or when accuracy is not a concern, the problem of identifying the closest relative can be solved reasonably well by coarse approaches, such as the top blast hit, or even the sequence with highest percentage identity. As the number of proteins grows larger and the number of homologs with similar distances increase, these methods show their limits. Indeed, it has been previously shown that the top blast hit is often not the closest relative <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. At least two ideas lead to better results: the use of evolutionary distance estimates such as PAM distances, and accounting for confidence intervals, so that whenever there is not enough information to reliably discriminate among several distances, all of them are kept, presumably for further analysis.</p>
               <p>Since the comparison of the methods requires precise and unbiased knowledge of the closest homolog, we use simulated data generated in the same way as in the section above, according to the PAM model. Families of homologs were created through mutation and duplication following random phylogenetic trees (Fig. <figr fid="F4">4</figr>) with the following properties: (i) each branch has a random mutation rate from a uniform distribution between 0 and 1, (ii) duplication occurs only along the leftmost branch, at random intervals, on average about every 6 PAM units, (iii) the generation is performed in 60 steps and results in trees with an average number of leaves of 13.04 (<it>&#963; </it>= 3.1). The very asymmetric duplication process is used to explore efficiently the parameter space, both in terms of distance magnitude to the closest homolog as in the number of homologs with similar distances.</p>
               <fig id="F4">
                  <title>
                     <p>Figure 4</p>
                  </title>
                  <caption>
                     <p>Tree randomly generated for closest homolog simulation</p>
                  </caption>
                  <text>
                     <p><b>Tree randomly generated for closest homolog simulation</b>. Example of a random tree (see text for description of the procedure) used to compare the different methods to infer the closest homolog to each leaf. Distances indicated are in PAM units.</p>
                  </text>
                  <graphic file="1471-2105-7-529-4"/>
               </fig>
               <p>For each protein <it>X </it>belonging to such a family, the closest homolog predictions using the following three criteria were compared to the actual closest homolog. The first one computes the subset of homologous sequences <it>H </it>that align with <it>X </it>with score higher than a particular fraction of the top score.</p>
               <p>
                  <m:math name="1471-2105-7-529-i10" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>S</m:mi>
                           <m:mi>e</m:mi>
                           <m:msub>
                              <m:mi>t</m:mi>
                              <m:mn>1</m:mn>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mo>{</m:mo>
                           <m:mi>Y</m:mi>
                           <m:mo>&#8712;</m:mo>
                           <m:mi>H</m:mi>
                           <m:mo>|</m:mo>
                           <m:mi>S</m:mi>
                           <m:mi>c</m:mi>
                           <m:mi>o</m:mi>
                           <m:mi>r</m:mi>
                           <m:mi>e</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>X</m:mi>
                           <m:mo>,</m:mo>
                           <m:mi>Y</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>&#8805;</m:mo>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mn>1</m:mn>
                           <m:mo>&#8722;</m:mo>
                           <m:msub>
                              <m:mi>k</m:mi>
                              <m:mn>1</m:mn>
                           </m:msub>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>&#8901;</m:mo>
                           <m:munder>
                              <m:mrow>
                                 <m:mi>max</m:mi>
                                 <m:mo>&#8289;</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mi>Z</m:mi>
                                 <m:mo>&#8712;</m:mo>
                                 <m:mi>H</m:mi>
                              </m:mrow>
                           </m:munder>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>S</m:mi>
                           <m:mi>c</m:mi>
                           <m:mi>o</m:mi>
                           <m:mi>r</m:mi>
                           <m:mi>e</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>X</m:mi>
                           <m:mo>,</m:mo>
                           <m:mi>Z</m:mi>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>}</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGtbWucqWGLbqzcqWG0baDdaWgaaWcbaGaeGymaedabeaakiabg2da9iabcUha7jabdMfazjabgIGiolabdIeaijabcYha8jabdofatjabdogaJjabd+gaVjabdkhaYjabdwgaLjabcIcaOiabdIfayjabcYcaSiabdMfazjabcMcaPiabgwMiZkabcIcaOiabigdaXiabgkHiTiabdUgaRnaaBaaaleaacqaIXaqmaeqaaOGaeiykaKIaeyyXIC9aaCbeaeaacyGGTbqBcqGGHbqycqGG4baEaSqaaiabdQfaAjabgIGiolabdIeaibqabaGccqGGOaakcqWGtbWucqWGJbWycqWGVbWBcqWGYbGCcqWGLbqzcqGGOaakcqWGybawcqGGSaalcqWGAbGwcqGGPaqkcqGGPaqkcqGG9bqFaaa@6694@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
               <p>The second method computes the set of closest homologs, without using our variance approximation, formally</p>
               <p><it>Set</it><sub>2 </sub>= {<it>Y </it>&#8712; <it>H </it>| &#8708; <it>Z </it>&#8712; <it>H</it>, <it>Z </it>&#8800; <it>Y</it>, <it>closer</it><sub><it>ind</it></sub>(<it>X</it>, <it>Z</it>, <it>Y</it>, <it>k</it><sub>2</sub>)}</p>
               <p>The third method computes the set of closest homologs using our approximation, formally</p>
               <p><it>Set</it><sub>3 </sub>= {<it>Y </it>&#8712; <it>H </it>| &#8708; <it>Y </it>&#8712; <it>H</it>, <it>Z </it>&#8800; <it>Y</it>, <it>closer</it><sub><it>app</it></sub>(<it>X</it>, <it>Z</it>, <it>Y</it>, <it>k</it><sub>3</sub>)}</p>
               <p>The cut-off parameters <it>k</it><sub>1</sub>, <it>k</it><sub>2</sub>, <it>k</it><sub>3 </sub>can be set according to the desired level of confidence. At <it>k </it>= 0, only the top score, respectively the shortest expected distance, is returned. Higher <it>k </it>values correspond to more conservative predictions, with increasing number of closest homolog candidates. For the evaluation of the methods, we vary <it>k</it><sub>1 </sub>between 0 and 0.25, while <it>k</it><sub>2</sub>, <it>k</it><sub>3 </sub>are varied between 0 and 3. Note that only <it>k</it><sub>3 </sub>corresponds to the number of standard deviations from the expected value.</p>
               <p>The resulting curves are presented in Fig. <figr fid="F5">5</figr>. At low cut-off values, all three methods perform similarly, but as <it>k </it>increases, the method using <it>closer</it><sub><it>app </it></sub>gives better results.</p>
               <fig id="F5">
                  <title>
                     <p>Figure 5</p>
                  </title>
                  <caption>
                     <p>Identification of the closest homolog</p>
                  </caption>
                  <text>
                     <p><b>Identification of the closest homolog</b>. Identification of the closest homolog: comparison between methods using alignment score (1), distance with assumption of independence (2) and distance using our variance approximation (3), on simulated data.</p>
                  </text>
                  <graphic file="1471-2105-7-529-5"/>
               </fig>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>Computing the difference of two evolutionary distances that are not independent is a common operation in an increasing number of bioinformatics analyses. We presented and compared two estimators for the difference of two evolutionary distances in a triplet of homologs, one estimator based on pairwise distance estimates and the maximum likelihood estimator. Surprisingly, the estimator based on pairwise distance is almost as powerful as the ML estimator. But in terms of time complexity, it scales much better than the ML estimator and is therefore better suited at large-scale analyses. However, since its variance is not easy to estimate, we introduced a numerical approximation that allows the computation of accurate confidence intervals. Finally, we showed how these results can be used to test for asymmetrical evolution, and to identify the closest relative of a sequence in a group of homologs without phylogenetic reconstruction. As of future work, we plan to extend these results to models of evolution allowing rate variations, as well as insertion-deletions.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>PAM distance estimator for a pair</p>
            </st>
            <p>The likelihood of an alignment <it>A </it>at an evolutionary distance <it>d </it>is defined <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp> as</p>
            <p>
               <m:math name="1471-2105-7-529-i11" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mtable>
                           <m:mtr>
                              <m:mtd>
                                 <m:mrow>
                                    <m:mi>L</m:mi>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:mi>A</m:mi>
                                    <m:mo>|</m:mo>
                                    <m:mi>d</m:mi>
                                    <m:mo stretchy="false">)</m:mo>
                                 </m:mrow>
                              </m:mtd>
                              <m:mtd>
                                 <m:mo>=</m:mo>
                              </m:mtd>
                              <m:mtd>
                                 <m:mrow>
                                    <m:mstyle displaystyle="true">
                                       <m:msub>
                                          <m:mo>&#8719;</m:mo>
                                          <m:mrow>
                                             <m:mo stretchy="false">[</m:mo>
                                             <m:mi>x</m:mi>
                                             <m:mo>,</m:mo>
                                             <m:mi>y</m:mi>
                                             <m:mo stretchy="false">]</m:mo>
                                             <m:mo>&#8712;</m:mo>
                                             <m:mi>A</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mrow>
                                          <m:mi>f</m:mi>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:mi>x</m:mi>
                                          <m:mo stretchy="false">)</m:mo>
                                          <m:msub>
                                             <m:mi>M</m:mi>
                                             <m:mrow>
                                                <m:mi>x</m:mi>
                                                <m:mo>,</m:mo>
                                                <m:mi>y</m:mi>
                                             </m:mrow>
                                          </m:msub>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:mi>d</m:mi>
                                          <m:mo stretchy="false">)</m:mo>
                                       </m:mrow>
                                    </m:mstyle>
                                 </m:mrow>
                              </m:mtd>
                           </m:mtr>
                           <m:mtr>
                              <m:mtd>
                                 <m:mrow/>
                              </m:mtd>
                              <m:mtd>
                                 <m:mo>=</m:mo>
                              </m:mtd>
                              <m:mtd>
                                 <m:mrow>
                                    <m:mstyle displaystyle="true">
                                       <m:msub>
                                          <m:mo>&#8719;</m:mo>
                                          <m:mrow>
                                             <m:mo stretchy="false">[</m:mo>
                                             <m:mi>x</m:mi>
                                             <m:mo>,</m:mo>
                                             <m:mi>y</m:mi>
                                             <m:mo stretchy="false">]</m:mo>
                                             <m:mo>&#8712;</m:mo>
                                             <m:mi>A</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:mrow>
                                          <m:mi>f</m:mi>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:mi>x</m:mi>
                                          <m:mo stretchy="false">)</m:mo>
                                          <m:msub>
                                             <m:mrow>
                                                <m:mrow>
                                                   <m:mo>[</m:mo>
                                                   <m:mrow>
                                                      <m:msup>
                                                         <m:mi>e</m:mi>
                                                         <m:mrow>
                                                            <m:mi>d</m:mi>
                                                            <m:mi>Q</m:mi>
                                                         </m:mrow>
                                                      </m:msup>
                                                   </m:mrow>
                                                   <m:mo>]</m:mo>
                                                </m:mrow>
                                             </m:mrow>
                                             <m:mrow>
                                                <m:mi>x</m:mi>
                                                <m:mo>,</m:mo>
                                                <m:mi>y</m:mi>
                                             </m:mrow>
                                          </m:msub>
                                       </m:mrow>
                                    </m:mstyle>
                                 </m:mrow>
                              </m:mtd>
                           </m:mtr>
                        </m:mtable>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaafaqadeGadaaabaGaemitaWKaeiikaGIaemyqaeKaeiiFaWNaemizaqMaeiykaKcabaGaeyypa0dabaWaaebeaeaacqWGMbGzcqGGOaakcqWG4baEcqGGPaqkcqWGnbqtdaWgaaWcbaGaemiEaGNaeiilaWIaemyEaKhabeaakiabcIcaOiabdsgaKjabcMcaPaWcbaGaei4waSLaemiEaGNaeiilaWIaemyEaKNaeiyxa0LaeyicI4SaemyqaeeabeqdcqGHpis1aaGcbaaabaGaeyypa0dabaWaaebeaeaacqWGMbGzcqGGOaakcqWG4baEcqGGPaqkdaWadaqaaiabdwgaLnaaCaaaleqabaGaemizaqMaemyuaefaaaGccaGLBbGaayzxaaWaaSbaaSqaaiabdIha4jabcYcaSiabdMha5bqabaaabaGaei4waSLaemiEaGNaeiilaWIaemyEaKNaeiyxa0LaeyicI4SaemyqaeeabeqdcqGHpis1aaaaaaa@660D@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>with <it>x </it>and <it>y </it>being aligned characters (e.g. amino acids, bases, but no deletions), and <it>f</it>(<it>x</it>) the background frequency of the character <it>x</it>. Maximizing <it>L</it>(<it>A </it>| <it>d</it>) in terms of <it>d </it>gives the ML estimator <m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math> of the evolutionary distance. This is usually done numerically using the Newton-Raphson method. The variance of the ML estimator <m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math> can be computed from the second derivative of the log-likelihood:</p>
            <p>
               <m:math name="1471-2105-7-529-i12" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:msup>
                           <m:mi>&#963;</m:mi>
                           <m:mn>2</m:mn>
                        </m:msup>
                        <m:mo stretchy="false">(</m:mo>
                        <m:mover accent="true">
                           <m:mi>d</m:mi>
                           <m:mo>^</m:mo>
                        </m:mover>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mo>=</m:mo>
                        <m:mo>&#8722;</m:mo>
                        <m:msup>
                           <m:mrow>
                              <m:mrow>
                                 <m:mo>(</m:mo>
                                 <m:mrow>
                                    <m:mfrac>
                                       <m:mrow>
                                          <m:msup>
                                             <m:mo>&#8706;</m:mo>
                                             <m:mn>2</m:mn>
                                          </m:msup>
                                          <m:mi>L</m:mi>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:mi>A</m:mi>
                                          <m:mo>|</m:mo>
                                          <m:mover accent="true">
                                             <m:mi>d</m:mi>
                                             <m:mo>^</m:mo>
                                          </m:mover>
                                          <m:mo stretchy="false">)</m:mo>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:mo>&#8706;</m:mo>
                                          <m:msup>
                                             <m:mi>d</m:mi>
                                             <m:mn>2</m:mn>
                                          </m:msup>
                                       </m:mrow>
                                    </m:mfrac>
                                 </m:mrow>
                                 <m:mo>)</m:mo>
                              </m:mrow>
                           </m:mrow>
                           <m:mrow>
                              <m:mo>&#8722;</m:mo>
                              <m:mn>1</m:mn>
                           </m:mrow>
                        </m:msup>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaahaaWcbeqaaiabikdaYaaakiabcIcaOiqbdsgaKzaajaGaeiykaKIaeyypa0JaeyOeI0YaaeWaaeaadaWcaaqaaiabgkGi2oaaCaaaleqabaGaeGOmaidaaOGaemitaWKaeiikaGIaemyqaeKaeiiFaWNafmizaqMbaKaacqGGPaqkaeaacqGHciITcqWGKbazdaahaaWcbeqaaiabikdaYaaaaaaakiaawIcacaGLPaaadaahaaWcbeqaaiabgkHiTiabigdaXaaaaaa@4576@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>Notice that the variance is obtained for free as it is already computed in Newton's iteration.</p>
         </sec>
         <sec>
            <st>
               <p>PAM distance estimator for a triplet</p>
            </st>
            <sec>
               <st>
                  <p>Estimator based on pairwise distances</p>
               </st>
               <p>One can estimate &#916; by performing pairwise alignments between <it>X </it>and <it>Y</it>, and between <it>X </it>and <it>Z</it>. The ML method for pairs of homologs, which was described above, computes the estimates <m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>XY </it></sub>and <m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>XZ</it></sub>. By subtracting the first from the second, an estimator for the difference is obtained:</p>
               <p><m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise </it></sub>= <m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>XY </it></sub>- <m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>XZ</it></sub></p>
               <p>Since the two pairwise distance estimators are asymptotically unbiased and normally distributed, and considering the linearity of the expected value and the fact that the difference of two normally distributed variables is also normally distributed, the pairwise estimator <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise </it></sub>is also asymptotically unbiased and normally distributed, with variance</p>
               <p>
                  <m:math name="1471-2105-7-529-i13" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msup>
                              <m:mi>&#963;</m:mi>
                              <m:mn>2</m:mn>
                           </m:msup>
                           <m:mo stretchy="false">(</m:mo>
                           <m:msub>
                              <m:mover accent="true">
                                 <m:mi>&#916;</m:mi>
                                 <m:mo>^</m:mo>
                              </m:mover>
                              <m:mrow>
                                 <m:mi>p</m:mi>
                                 <m:mi>a</m:mi>
                                 <m:mi>i</m:mi>
                                 <m:mi>r</m:mi>
                                 <m:mi>w</m:mi>
                                 <m:mi>i</m:mi>
                                 <m:mi>s</m:mi>
                                 <m:mi>e</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:msup>
                              <m:mi>&#963;</m:mi>
                              <m:mn>2</m:mn>
                           </m:msup>
                           <m:mo stretchy="false">(</m:mo>
                           <m:msub>
                              <m:mover accent="true">
                                 <m:mi>d</m:mi>
                                 <m:mo>^</m:mo>
                              </m:mover>
                              <m:mrow>
                                 <m:mi>X</m:mi>
                                 <m:mi>Y</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>+</m:mo>
                           <m:msup>
                              <m:mi>&#963;</m:mi>
                              <m:mn>2</m:mn>
                           </m:msup>
                           <m:mo stretchy="false">(</m:mo>
                           <m:msub>
                              <m:mover accent="true">
                                 <m:mi>d</m:mi>
                                 <m:mo>^</m:mo>
                              </m:mover>
                              <m:mrow>
                                 <m:mi>X</m:mi>
                                 <m:mi>Z</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>&#8722;</m:mo>
                           <m:mn>2</m:mn>
                           <m:mi>c</m:mi>
                           <m:mi>o</m:mi>
                           <m:mi>v</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:msub>
                              <m:mover accent="true">
                                 <m:mi>d</m:mi>
                                 <m:mo>^</m:mo>
                              </m:mover>
                              <m:mrow>
                                 <m:mi>X</m:mi>
                                 <m:mi>Y</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>,</m:mo>
                           <m:msub>
                              <m:mover accent="true">
                                 <m:mi>d</m:mi>
                                 <m:mo>^</m:mo>
                              </m:mover>
                              <m:mrow>
                                 <m:mi>X</m:mi>
                                 <m:mi>Z</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo stretchy="false">)</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaahaaWcbeqaaiabikdaYaaakiabcIcaOiqbfs5aezaajaWaaSbaaSqaaiabdchaWjabdggaHjabdMgaPjabdkhaYjabdEha3jabdMgaPjabdohaZjabdwgaLbqabaGccqGGPaqkcqGH9aqpcqWFdpWCdaahaaWcbeqaaiabikdaYaaakiabcIcaOiqbdsgaKzaajaWaaSbaaSqaaiabdIfayjabdMfazbqabaGccqGGPaqkcqGHRaWkcqWFdpWCdaahaaWcbeqaaiabikdaYaaakiabcIcaOiqbdsgaKzaajaWaaSbaaSqaaiabdIfayjabdQfaAbqabaGccqGGPaqkcqGHsislcqaIYaGmieGacqGFJbWycqGFVbWBcqGF2bGDcqGGOaakcuWGKbazgaqcamaaBaaaleaacqWGybawcqWGzbqwaeqaaOGaeiilaWIafmizaqMbaKaadaWgaaWcbaGaemiwaGLaemOwaOfabeaakiabcMcaPaaa@61EF@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
               <p>As described above, we obtain <it>&#963;</it><sup>2</sup>(<m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>XY</it></sub>) and <it>&#963;</it><sup>2</sup>(<m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>XZ</it></sub>) from the ML distance estimation, but the process does not say anything about their covariance. If the two distances are independent, which is only the case if <it>d</it><sub><it>OX </it></sub>= 0, the covariance is zero and the variance <m:math name="1471-2105-7-529-i14" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:msubsup><m:mi>&#963;</m:mi><m:mrow><m:mi>i</m:mi><m:mi>n</m:mi><m:mi>d</m:mi></m:mrow><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaqhaaWcbaGaemyAaKMaemOBa4MaemizaqgabaGaeGOmaidaaaaa@33A6@</m:annotation></m:semantics></m:math>(<m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>pairwise</it></sub>) = <it>&#963;</it><sup>2</sup>(<m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>XY</it></sub>) + <it>&#963;</it><sup>2</sup>(<m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>XZ</it></sub>) can be computed. In all other cases, <m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>XY </it></sub>and <m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>XZ </it></sub>covary and the variance of their difference is smaller than the sum of their variances. Therefore, we only have an upper bound for the variance of our estimator:</p>
               <p>
                  <m:math name="1471-2105-7-529-i15" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msup>
                              <m:mi>&#963;</m:mi>
                              <m:mn>2</m:mn>
                           </m:msup>
                           <m:mo stretchy="false">(</m:mo>
                           <m:msub>
                              <m:mover accent="true">
                                 <m:mi>&#916;</m:mi>
                                 <m:mo>^</m:mo>
                              </m:mover>
                              <m:mrow>
                                 <m:mi>p</m:mi>
                                 <m:mi>a</m:mi>
                                 <m:mi>i</m:mi>
                                 <m:mi>r</m:mi>
                                 <m:mi>w</m:mi>
                                 <m:mi>i</m:mi>
                                 <m:mi>s</m:mi>
                                 <m:mi>e</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>&#8804;</m:mo>
                           <m:msubsup>
                              <m:mi>&#963;</m:mi>
                              <m:mrow>
                                 <m:mi>i</m:mi>
                                 <m:mi>n</m:mi>
                                 <m:mi>d</m:mi>
                              </m:mrow>
                              <m:mn>2</m:mn>
                           </m:msubsup>
                           <m:mo stretchy="false">(</m:mo>
                           <m:msub>
                              <m:mover accent="true">
                                 <m:mi>&#916;</m:mi>
                                 <m:mo>^</m:mo>
                              </m:mover>
                              <m:mrow>
                                 <m:mi>p</m:mi>
                                 <m:mi>a</m:mi>
                                 <m:mi>i</m:mi>
                                 <m:mi>r</m:mi>
                                 <m:mi>w</m:mi>
                                 <m:mi>i</m:mi>
                                 <m:mi>s</m:mi>
                                 <m:mi>e</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo stretchy="false">)</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaaiiGacqWFdpWCdaahaaWcbeqaaiabikdaYaaakiabcIcaOiqbfs5aezaajaWaaSbaaSqaaiabdchaWjabdggaHjabdMgaPjabdkhaYjabdEha3jabdMgaPjabdohaZjabdwgaLbqabaGccqGGPaqkcqGHKjYOcqWFdpWCdaqhaaWcbaGaemyAaKMaemOBa4MaemizaqgabaGaeGOmaidaaOGaeiikaGIafuiLdqKbaKaadaWgaaWcbaGaemiCaaNaemyyaeMaemyAaKMaemOCaiNaem4DaCNaemyAaKMaem4CamNaemyzaugabeaakiabcMcaPaaa@5528@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
               <p>Note that previous work on covariance estimation (e.g. <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B28">28</abbr></abbrgrp>) do not apply here, because they require 3-way sequence alignments and are constrained to parametric models of evolution such as Jukes-Cantor and its generalizations.</p>
            </sec>
            <sec>
               <st>
                  <p>Estimator based on triplet</p>
               </st>
               <p>Alternatively, we can estimate &#916; by subtracting estimates of the distances <it>d</it><sub><it>OY </it></sub>and <it>d</it><sub><it>OZ</it></sub></p>
               <p><m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>triplet </it></sub>= <m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>OY </it></sub>- <m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>OZ</it></sub></p>
               <p>The estimates <m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>OY </it></sub>and <m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>OZ </it></sub>can be obtained by maximum likelihood over the multiple sequence alignment of <it>X</it>, <it>Y</it>, <it>Z </it><abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, in a manner analogous to the ML estimation for a pair. The likelihood <it>L </it>of a multiple sequence alignment (MSA) is the product, over all positions of the MSA, of the probability of observing characters <it>x</it>, <it>y</it>, <it>z </it>at distance <it>d</it><sub><it>OX</it></sub>, <it>d</it><sub><it>OY</it></sub>, <it>d</it><sub><it>OZ </it></sub>of the origin, where such a probability is obtained by marginalizing over every character <it>o </it>at the origin:</p>
               <p>
                  <m:math name="1471-2105-7-529-i16" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>L</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>M</m:mi>
                           <m:mi>S</m:mi>
                           <m:mi>A</m:mi>
                           <m:mo>|</m:mo>
                           <m:msub>
                              <m:mi>d</m:mi>
                              <m:mrow>
                                 <m:mi>O</m:mi>
                                 <m:mi>X</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>,</m:mo>
                           <m:msub>
                              <m:mi>d</m:mi>
                              <m:mrow>
                                 <m:mi>O</m:mi>
                                 <m:mi>Y</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>,</m:mo>
                           <m:msub>
                              <m:mi>d</m:mi>
                              <m:mrow>
                                 <m:mi>O</m:mi>
                                 <m:mi>Z</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:munder>
                                 <m:mo>&#8719;</m:mo>
                                 <m:mrow>
                                    <m:mo stretchy="false">[</m:mo>
                                    <m:mi>x</m:mi>
                                    <m:mo>,</m:mo>
                                    <m:mi>y</m:mi>
                                    <m:mo>,</m:mo>
                                    <m:mi>z</m:mi>
                                    <m:mo stretchy="false">]</m:mo>
                                 </m:mrow>
                              </m:munder>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:munder>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mrow>
                                          <m:mi>o</m:mi>
                                          <m:mo>&#8712;</m:mo>
                                          <m:mi>C</m:mi>
                                       </m:mrow>
                                    </m:munder>
                                    <m:mrow>
                                       <m:mi>f</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>o</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:msub>
                                          <m:mrow>
                                             <m:mo stretchy="false">[</m:mo>
                                             <m:msup>
                                                <m:mi>e</m:mi>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>d</m:mi>
                                                      <m:mrow>
                                                         <m:mi>O</m:mi>
                                                         <m:mi>X</m:mi>
                                                      </m:mrow>
                                                   </m:msub>
                                                   <m:mi>Q</m:mi>
                                                </m:mrow>
                                             </m:msup>
                                             <m:mo stretchy="false">]</m:mo>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mi>o</m:mi>
                                             <m:mo>,</m:mo>
                                             <m:mi>x</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:msub>
                                          <m:mrow>
                                             <m:mo stretchy="false">[</m:mo>
                                             <m:msup>
                                                <m:mi>e</m:mi>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>d</m:mi>
                                                      <m:mrow>
                                                         <m:mi>O</m:mi>
                                                         <m:mi>Y</m:mi>
                                                      </m:mrow>
                                                   </m:msub>
                                                   <m:mi>Q</m:mi>
                                                </m:mrow>
                                             </m:msup>
                                             <m:mo stretchy="false">]</m:mo>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mi>o</m:mi>
                                             <m:mo>,</m:mo>
                                             <m:mi>y</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:msub>
                                          <m:mrow>
                                             <m:mo stretchy="false">[</m:mo>
                                             <m:msup>
                                                <m:mi>e</m:mi>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>d</m:mi>
                                                      <m:mrow>
                                                         <m:mi>O</m:mi>
                                                         <m:mi>Z</m:mi>
                                                      </m:mrow>
                                                   </m:msub>
                                                   <m:mi>Q</m:mi>
                                                </m:mrow>
                                             </m:msup>
                                             <m:mo stretchy="false">]</m:mo>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mi>o</m:mi>
                                             <m:mo>,</m:mo>
                                             <m:mi>z</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                           </m:mstyle>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGmbatcqGGOaakcqWGnbqtcqWGtbWucqWGbbqqcqGG8baFcqWGKbazdaWgaaWcbaGaem4ta8KaemiwaGfabeaakiabcYcaSiabdsgaKnaaBaaaleaacqWGpbWtcqWGzbqwaeqaaOGaeiilaWIaemizaq2aaSbaaSqaaiabd+eapjabdQfaAbqabaGccqGGPaqkcqGH9aqpdaqeqbqaamaaqafabaGaemOzayMaeiikaGIaem4Ba8MaeiykaKIaei4waSLaemyzau2aaWbaaSqabeaacqWGKbazdaWgaaadbaGaem4ta8KaemiwaGfabeaaliabdgfarbaakiabc2faDnaaBaaaleaacqWGVbWBcqGGSaalcqWG4baEaeqaaOGaei4waSLaemyzau2aaWbaaSqabeaacqWGKbazdaWgaaadbaGaem4ta8KaemywaKfabeaaliabdgfarbaakiabc2faDnaaBaaaleaacqWGVbWBcqGGSaalcqWG5bqEaeqaaOGaei4waSLaemyzau2aaWbaaSqabeaacqWGKbazdaWgaaadbaGaem4ta8KaemOwaOfabeaaliabdgfarbaakiabc2faDnaaBaaaleaacqWGVbWBcqGGSaalcqWG6bGEaeqaaaqaaiabd+gaVjabgIGiolabdoeadbqab0GaeyyeIuoaaSqaaiabcUfaBjabdIha4jabcYcaSiabdMha5jabcYcaSiabdQha6jabc2faDbqab0Gaey4dIunaaaa@7F5E@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
               <p>where <it>C </it>is the set of characters &#8211; the 20 amino-acids in the present case, and <it>f</it>(<it>o</it>) the background frequency of the character <it>o</it>. Consequently, the log-likelihood function <it>l </it>is</p>
               <p>
                  <m:math name="1471-2105-7-529-i17" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>l</m:mi>
                           <m:mo stretchy="false">(</m:mo>
                           <m:mi>M</m:mi>
                           <m:mi>S</m:mi>
                           <m:mi>A</m:mi>
                           <m:mo>|</m:mo>
                           <m:msub>
                              <m:mi>d</m:mi>
                              <m:mrow>
                                 <m:mi>O</m:mi>
                                 <m:mi>X</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>,</m:mo>
                           <m:msub>
                              <m:mi>d</m:mi>
                              <m:mrow>
                                 <m:mi>O</m:mi>
                                 <m:mi>Y</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>,</m:mo>
                           <m:msub>
                              <m:mi>d</m:mi>
                              <m:mrow>
                                 <m:mi>O</m:mi>
                                 <m:mi>Z</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo stretchy="false">)</m:mo>
                           <m:mo>=</m:mo>
                           <m:mstyle displaystyle="true">
                              <m:munder>
                                 <m:mo>&#8721;</m:mo>
                                 <m:mrow>
                                    <m:mo stretchy="false">[</m:mo>
                                    <m:mi>x</m:mi>
                                    <m:mo>,</m:mo>
                                    <m:mi>y</m:mi>
                                    <m:mo>,</m:mo>
                                    <m:mi>z</m:mi>
                                    <m:mo stretchy="false">]</m:mo>
                                 </m:mrow>
                              </m:munder>
                              <m:mrow>
                                 <m:mi>log</m:mi>
                                 <m:mo>&#8289;</m:mo>
                                 <m:mstyle displaystyle="true">
                                    <m:munder>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mrow>
                                          <m:mi>o</m:mi>
                                          <m:mo>&#8712;</m:mo>
                                          <m:mi>C</m:mi>
                                       </m:mrow>
                                    </m:munder>
                                    <m:mrow>
                                       <m:mi>f</m:mi>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mi>o</m:mi>
                                       <m:mo stretchy="false">)</m:mo>
                                       <m:msub>
                                          <m:mrow>
                                             <m:mo stretchy="false">[</m:mo>
                                             <m:msup>
                                                <m:mi>e</m:mi>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>d</m:mi>
                                                      <m:mrow>
                                                         <m:mi>O</m:mi>
                                                         <m:mi>X</m:mi>
                                                      </m:mrow>
                                                   </m:msub>
                                                   <m:mi>Q</m:mi>
                                                </m:mrow>
                                             </m:msup>
                                             <m:mo stretchy="false">]</m:mo>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mi>o</m:mi>
                                             <m:mo>,</m:mo>
                                             <m:mi>x</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:msub>
                                          <m:mrow>
                                             <m:mo stretchy="false">[</m:mo>
                                             <m:msup>
                                                <m:mi>e</m:mi>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>d</m:mi>
                                                      <m:mrow>
                                                         <m:mi>O</m:mi>
                                                         <m:mi>Y</m:mi>
                                                      </m:mrow>
                                                   </m:msub>
                                                   <m:mi>Q</m:mi>
                                                </m:mrow>
                                             </m:msup>
                                             <m:mo stretchy="false">]</m:mo>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mi>o</m:mi>
                                             <m:mo>,</m:mo>
                                             <m:mi>y</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                       <m:msub>
                                          <m:mrow>
                                             <m:mo stretchy="false">[</m:mo>
                                             <m:msup>
                                                <m:mi>e</m:mi>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>d</m:mi>
                                                      <m:mrow>
                                                         <m:mi>O</m:mi>
                                                         <m:mi>Z</m:mi>
                                                      </m:mrow>
                                                   </m:msub>
                                                   <m:mi>Q</m:mi>
                                                </m:mrow>
                                             </m:msup>
                                             <m:mo stretchy="false">]</m:mo>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mi>o</m:mi>
                                             <m:mo>,</m:mo>
                                             <m:mi>z</m:mi>
                                          </m:mrow>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                           </m:mstyle>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGSbaBcqGGOaakcqWGnbqtcqWGtbWucqWGbbqqcqGG8baFcqWGKbazdaWgaaWcbaGaem4ta8KaemiwaGfabeaakiabcYcaSiabdsgaKnaaBaaaleaacqWGpbWtcqWGzbqwaeqaaOGaeiilaWIaemizaq2aaSbaaSqaaiabd+eapjabdQfaAbqabaGccqGGPaqkcqGH9aqpdaaeqbqaaiGbcYgaSjabc+gaVjabcEgaNnaaqafabaGaemOzayMaeiikaGIaem4Ba8MaeiykaKIaei4waSLaemyzau2aaWbaaSqabeaacqWGKbazdaWgaaadbaGaem4ta8KaemiwaGfabeaaliabdgfarbaakiabc2faDnaaBaaaleaacqWGVbWBcqGGSaalcqWG4baEaeqaaOGaei4waSLaemyzau2aaWbaaSqabeaacqWGKbazdaWgaaadbaGaem4ta8KaemywaKfabeaaliabdgfarbaakiabc2faDnaaBaaaleaacqWGVbWBcqGGSaalcqWG5bqEaeqaaOGaei4waSLaemyzau2aaWbaaSqabeaacqWGKbazdaWgaaadbaGaem4ta8KaemOwaOfabeaaliabdgfarbaakiabc2faDnaaBaaaleaacqWGVbWBcqGGSaalcqWG6bGEaeqaaaqaaiabd+gaVjabgIGiolabdoeadbqab0GaeyyeIuoaaSqaaiabcUfaBjabdIha4jabcYcaSiabdMha5jabcYcaSiabdQha6jabc2faDbqab0GaeyyeIuoaaaa@83CD@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
               <p>The log-likelihood is maximum where its gradient disappears:</p>
               <p>
                  <m:math name="1471-2105-7-529-i18" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mo>&#8711;</m:mo>
                           <m:mi>l</m:mi>
                           <m:mo>=</m:mo>
                           <m:mrow>
                              <m:mo>(</m:mo>
                              <m:mrow>
                                 <m:mtable>
                                    <m:mtr>
                                       <m:mtd>
                                          <m:mrow>
                                             <m:mo>&#8706;</m:mo>
                                             <m:mi>l</m:mi>
                                             <m:mo>/</m:mo>
                                             <m:mo>&#8706;</m:mo>
                                             <m:msub>
                                                <m:mi>d</m:mi>
                                                <m:mrow>
                                                   <m:mi>O</m:mi>
                                                   <m:mi>X</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr>
                                       <m:mtd>
                                          <m:mrow>
                                             <m:mo>&#8706;</m:mo>
                                             <m:mi>l</m:mi>
                                             <m:mo>/</m:mo>
                                             <m:mo>&#8706;</m:mo>
                                             <m:msub>
                                                <m:mi>d</m:mi>
                                                <m:mrow>
                                                   <m:mi>O</m:mi>
                                                   <m:mi>Y</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr>
                                       <m:mtd>
                                          <m:mrow>
                                             <m:mo>&#8706;</m:mo>
                                             <m:mi>l</m:mi>
                                             <m:mo>/</m:mo>
                                             <m:mo>&#8706;</m:mo>
                                             <m:msub>
                                                <m:mi>d</m:mi>
                                                <m:mrow>
                                                   <m:mi>O</m:mi>
                                                   <m:mi>Z</m:mi>
                                                </m:mrow>
                                             </m:msub>
                                          </m:mrow>
                                       </m:mtd>
                                    </m:mtr>
                                 </m:mtable>
                              </m:mrow>
                              <m:mo>)</m:mo>
                           </m:mrow>
                           <m:mo>=</m:mo>
                           <m:mrow>
                              <m:mo>(</m:mo>
                              <m:mrow>
                                 <m:mtable>
                                    <m:mtr>
                                       <m:mtd>
                                          <m:mn>0</m:mn>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr>
                                       <m:mtd>
                                          <m:mn>0</m:mn>
                                       </m:mtd>
                                    </m:mtr>
                                    <m:mtr>
                                       <m:mtd>
                                          <m:mn>0</m:mn>
                                       </m:mtd>
                                    </m:mtr>
                                 </m:mtable>
                              </m:mrow>
                              <m:mo>)</m:mo>
                           </m:mrow>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqGHhis0cqWGSbaBcqGH9aqpdaqadaqaauaabeqadeaaaeaacqGHciITcqWGSbaBcqGGVaWlcqGHciITcqWGKbazdaWgaaWcbaGaem4ta8KaemiwaGfabeaaaOqaaiabgkGi2kabdYgaSjabc+caViabgkGi2kabdsgaKnaaBaaaleaacqWGpbWtcqWGzbqwaeqaaaGcbaGaeyOaIyRaemiBaWMaei4la8IaeyOaIyRaemizaq2aaSbaaSqaaiabd+eapjabdQfaAbqabaaaaaGccaGLOaGaayzkaaGaeyypa0ZaaeWaaeaafaqabeWabaaabaGaeGimaadabaGaeGimaadabaGaeGimaadaaaGaayjkaiaawMcaaaaa@528D@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
               <p>There again, the problem can be solved efficiently by Newton's iteration</p>
               <p>
                  <m:math name="1471-2105-7-529-i19" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mrow>
                                 <m:mrow>
                                    <m:mo>(</m:mo>
                                    <m:mrow>
                                       <m:mtable>
                                          <m:mtr>
                                             <m:mtd>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mover accent="true">
                                                         <m:mi>d</m:mi>
                                                         <m:mo>^</m:mo>
                                                      </m:mover>
                                                      <m:mrow>
                                                         <m:mi>O</m:mi>
                                                         <m:mi>X</m:mi>
                                                      </m:mrow>
                                                   </m:msub>
                                                </m:mrow>
                                             </m:mtd>
                                          </m:mtr>
                                          <m:mtr>
                                             <m:mtd>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mover accent="true">
                                                         <m:mi>d</m:mi>
                                                         <m:mo>^</m:mo>
                                                      </m:mover>
                                                      <m:mrow>
                                                         <m:mi>O</m:mi>
                                                         <m:mi>Y</m:mi>
                                                      </m:mrow>
                                                   </m:msub>
                                                </m:mrow>
                                             </m:mtd>
                                          </m:mtr>
                                          <m:mtr>
                                             <m:mtd>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mover accent="true">
                                                         <m:mi>d</m:mi>
                                                         <m:mo>^</m:mo>
                                                      </m:mover>
                                                      <m:mrow>
                                                         <m:mi>O</m:mi>
                                                         <m:mi>Z</m:mi>
                                                      </m:mrow>
                                                   </m:msub>
                                                </m:mrow>
                                             </m:mtd>
                                          </m:mtr>
                                       </m:mtable>
                                    </m:mrow>
                                    <m:mo>)</m:mo>
                                 </m:mrow>
                              </m:mrow>
                              <m:mrow>
                                 <m:mi>i</m:mi>
                                 <m:mo>+</m:mo>
                                 <m:mn>1</m:mn>
                              </m:mrow>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:msub>
                              <m:mrow>
                                 <m:mrow>
                                    <m:mo>(</m:mo>
                                    <m:mrow>
                                       <m:mtable>
                                          <m:mtr>
                                             <m:mtd>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mover accent="true">
                                                         <m:mi>d</m:mi>
                                                         <m:mo>^</m:mo>
                                                      </m:mover>
                                                      <m:mrow>
                                                         <m:mi>O</m:mi>
                                                         <m:mi>X</m:mi>
                                                      </m:mrow>
                                                   </m:msub>
                                                </m:mrow>
                                             </m:mtd>
                                          </m:mtr>
                                          <m:mtr>
                                             <m:mtd>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mover accent="true">
                                                         <m:mi>d</m:mi>
                                                         <m:mo>^</m:mo>
                                                      </m:mover>
                                                      <m:mrow>
                                                         <m:mi>O</m:mi>
                                                         <m:mi>Y</m:mi>
                                                      </m:mrow>
                                                   </m:msub>
                                                </m:mrow>
                                             </m:mtd>
                                          </m:mtr>
                                          <m:mtr>
                                             <m:mtd>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mover accent="true">
                                                         <m:mi>d</m:mi>
                                                         <m:mo>^</m:mo>
                                                      </m:mover>
                                                      <m:mrow>
                                                         <m:mi>O</m:mi>
                                                         <m:mi>Z</m:mi>
                                                      </m:mrow>
                                                   </m:msub>
                                                </m:mrow>
                                             </m:mtd>
                                          </m:mtr>
                                       </m:mtable>
                                    </m:mrow>
                                    <m:mo>)</m:mo>
                                 </m:mrow>
                              </m:mrow>
                              <m:mi>i</m:mi>
                           </m:msub>
                           <m:mo>&#8722;</m:mo>
                           <m:msup>
                              <m:mrow>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:msup>
                                    <m:mo>&#8711;</m:mo>
                                    <m:mn>2</m:mn>
                                 </m:msup>
                                 <m:msub>
                                    <m:mi>l</m:mi>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mn>1</m:mn>
                              </m:mrow>
                           </m:msup>
                           <m:mo>&#8711;</m:mo>
                           <m:msub>
                              <m:mi>l</m:mi>
                              <m:mi>i</m:mi>
                           </m:msub>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaqadaqaauaabeqadeaaaeaacuWGKbazgaqcamaaBaaaleaacqWGpbWtcqWGybawaeqaaaGcbaGafmizaqMbaKaadaWgaaWcbaGaem4ta8KaemywaKfabeaaaOqaaiqbdsgaKzaajaWaaSbaaSqaaiabd+eapjabdQfaAbqabaaaaaGccaGLOaGaayzkaaWaaSbaaSqaaiabdMgaPjabgUcaRiabigdaXaqabaGccqGH9aqpdaqadaqaauaabeqadeaaaeaacuWGKbazgaqcamaaBaaaleaacqWGpbWtcqWGybawaeqaaaGcbaGafmizaqMbaKaadaWgaaWcbaGaem4ta8KaemywaKfabeaaaOqaaiqbdsgaKzaajaWaaSbaaSqaaiabd+eapjabdQfaAbqabaaaaaGccaGLOaGaayzkaaWaaSbaaSqaaiabdMgaPbqabaGccqGHsislcqGGOaakcqGHhis0daahaaWcbeqaaiabikdaYaaakiabdYgaSnaaBaaaleaacqWGPbqAaeqaaOGaeiykaKYaaWbaaSqabeaacqGHsislcqaIXaqmaaGccqGHhis0cqWGSbaBdaWgaaWcbaGaemyAaKgabeaaaaa@5C6E@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
               <p>where (&#8711;<sup>2</sup><it>l</it>)<sup>-1 </sup>is the inverse of the Hessian (derivable in the same fashion as the gradient, not shown here). The inverse of the Hessian also yields the variance-covariance matrix of the estimates <m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>OX</it></sub>, <m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>OY</it></sub>, <m:math name="1471-2105-7-529-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>d</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuWGKbazgaqcaaaa@2E0D@</m:annotation></m:semantics></m:math><sub><it>OZ </it></sub>when multiplied by -1. A final use of the Hessian is to check that its complement is positive definite, a condition necessary to ensure that the solution found is indeed a maximum and not a minimum or a saddle point. Therefore, we obtain the variance of <m:math name="1471-2105-7-529-i1" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mover accent="true"><m:mi>&#916;</m:mi><m:mo>^</m:mo></m:mover><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacuqHuoargaqcaaaa@2E22@</m:annotation></m:semantics></m:math><sub><it>triplet </it></sub>from the variance-covariance matrix:</p>
               <p>
                  <m:math name="1471-2105-7-529-i20" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mtable>
                           <m:mtr>
                              <m:mtd>
                                 <m:msup>
                                    <m:mi>&#963;</m:mi>
                                    <m:mn>2</m:mn>
                                 </m:msup>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:msub>
                                    <m:mover accent="true">
                                       <m:mi>&#916;</m:mi>
                                       <m:mo>^</m:mo>
                                    </m:mover>
                                    <m:mrow>
                                       <m:mi>t</m:mi>
                                       <m:mi>r</m:mi>
                                       <m:mi>i</m:mi>
                                       <m:mi>p</m:mi>
                                       <m:mi>l</m:mi>
                                       <m:mi>e</m:mi>
                                       <m:mi>t</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>=</m:mo>
                                 <m:msup>
                                    <m:mi>&#963;</m:mi>
                                    <m:mn>2</m:mn>
                                 </m:msup>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:msub>
                                    <m:mover accent="true">
                                       <m:mi>d</m:mi>
                                       <m:mo>^</m:mo>
                                    </m:mover>
                                    <m:mrow>
                                       <m:mi>O</m:mi>
                                       <m:mi>Y</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>+</m:mo>
                                 <m:msup>
                                    <m:mi>&#963;</m:mi>
                                    <m:mn>2</m:mn>
                                 </m:msup>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:msub>
                                    <m:mover accent="true">
                                       <m:mi>d</m:mi>
                                       <m:mo>^</m:mo>
                                    </m:mover>
                                    <m:mrow>
                                       <m:mi>O</m:mi>
                                       <m:mi>Z</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mn>2</m:mn>
                                 <m:mi>c</m:mi>
                                 <m:mi>o</m:mi>
                                 <m:mi>v</m:mi>
                                 <m:mo stretchy="false">(</m:mo>
                                 <m:msub>
                                    <m:mover accent="true">
                                       <m:mi>d</m:mi>
                                       <m:mo>^</m:mo>
                                    </m:mover>
                                    <m:mrow>
                                       <m:mi>O</m:mi>
                                       <m:mi>Y</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo>,</m:mo>
                                 <m:msub>
                                    <m:mover accent="true">
                                       <m:mi>d</m:mi>
                                       <m:mo>^</m:mo>
                                    </m:mover>
                                    <m:mrow>
                                       <m:mi>O</m:mi>
                                       <m:mi>Z</m:mi>
                                    </m:mrow>
                                 </m:msub>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mtd>
                           </m:mtr>
                           <m:mtr>
                              <m:mtd>
                                 <m:mo>=</m:mo>
                                 <m:mo stretchy="false">[</m:mo>
                                 <m:mn>0</m:mn>
                                 <m:mo>,</m:mo>
                                 <m:mn>1</m:mn>
                                 <m:mo>,</m:mo>
                                 <m:mo>&#8722;</m:mo>
                                 <m:mn>1</m:mn>
                                 <m:mo stretchy="false">]</m:mo>
                                 <m:msup>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:mo>&#8722;</m:mo>
                                    <m:msup>
                                       <m:mo>&#8711;</m:mo>
                                       <m:mn>2</m:mn>
                                    </m:msup>
                                    <m:mi>l</m:mi>
                                    <m:mo stretchy="false">)</m:mo>
                                    <m:mrow>
                                       <m:mo>&#8722;</m:mo>
                                       <m:mn>1</m:mn>
                                    </m:mrow>
                                 </m:msup>
                                 <m:mrow>
                                    <m:mo>[</m:mo>
                                    <m:mrow>
                                       <m:mtable>
                                          <m:mtr>
                                             <m:mtd>
                                                <m:mn>0</m:mn>
                                             </m:mtd>
                                          </m:mtr>
                                          <m:mtr>
                                             <m:mtd>
                                                <m:mn>1</m:mn>
                                             </m:mtd>
                                          </m:mtr>
                                          <m:mtr>
                                             <m:mtd>
                                                <m:mrow>
                                                   <m:mo>&#8722;</m:mo>
                                                   <m:mn>1</m:mn>
                                                </m:mrow>
                                             </m:mtd>
                                          </m:mtr>
                                       </m:mtable>
                                    </m:mrow>
                                    <m:mo>]</m:mo>
                                 </m:mrow>
                              </m:mtd>
                           </m:mtr>
                        </m:mtable>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakqaaeeqaaGGaciab=n8aZnaaCaaaleqabaGaeGOmaidaaOGaeiikaGIafuiLdqKbaKaadaWgaaWcbaGaemiDaqNaemOCaiNaemyAaKMaemiCaaNaemiBaWMaemyzauMaemiDaqhabeaakiabcMcaPiabg2da9iab=n8aZnaaCaaaleqabaGaeGOmaidaaOGaeiikaGIafmizaqMbaKaadaWgaaWcbaGaem4ta8KaemywaKfabeaakiabcMcaPiabgUcaRiab=n8aZnaaCaaaleqabaGaeGOmaidaaOGaeiikaGIafmizaqMbaKaadaWgaaWcbaGaem4ta8KaemOwaOfabeaakiabcMcaPiabgkHiTiabikdaYGqaciab+ngaJjab+9gaVjab+zha2jabcIcaOiqbdsgaKzaajaWaaSbaaSqaaiabd+eapjabdMfazbqabaGccqGGSaalcuWGKbazgaqcamaaBaaaleaacqWGpbWtcqWGAbGwaeqaaOGaeiykaKcabaGaeyypa0Jaei4waSLaeGimaaJaeiilaWIaeGymaeJaeiilaWIaeyOeI0IaeGymaeJaeiyxa0LaeiikaGIaeyOeI0Iaey4bIe9aaWbaaSqabeaacqaIYaGmaaGccqWGSbaBcqGGPaqkdaahaaWcbeqaaiabgkHiTiabigdaXaaakmaadmaabaqbaeqabmqaaaqaaiabicdaWaqaaiabigdaXaqaaiabgkHiTiabigdaXaaaaiaawUfacaGLDbaaaaaa@77E8@</m:annotation>
                     </m:semantics>
                  </m:math>
               </p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>CD was the main investigator and writer. MG contributed ideas, wrote part of the method section, and performed simulations. AS contributed the introduction to PAM distances, and the section on asymmetrical evolution. GG devised the numerical approximation and contributed the appendix. All authors read and approved the final manuscript.</p>
      </sec>
      <sec>
         <st>
            <p>Appendix</p>
         </st>
         <sec>
            <st>
               <p>Complexity of the analytical solution of <it>k</it>-states model for triplets</p>
            </st>
            <p>In the following, we show that the analytical solution of the maximum-likelihood estimator for the distances of a triplet is very complex, even for a simplified model of mutation. The <it>k</it>-state model <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> is an idealized situation where each position has <it>k </it>possible states and the transition probabilities are all identical and only depend on the time <it>t</it>. For <it>k </it>= 4 this is equivalent to the Jukes-Cantor model <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Whatever is the initial state, the probability of a mutation after time <it>t </it>is given by</p>
            <p>
               <m:math name="1471-2105-7-529-i21" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mi>p</m:mi>
                        <m:mo stretchy="false">(</m:mo>
                        <m:mi>t</m:mi>
                        <m:mo stretchy="false">)</m:mo>
                        <m:mo>=</m:mo>
                        <m:mfrac>
                           <m:mrow>
                              <m:mi>k</m:mi>
                              <m:mo>&#8722;</m:mo>
                              <m:mn>1</m:mn>
                           </m:mrow>
                           <m:mi>k</m:mi>
                        </m:mfrac>
                        <m:mo stretchy="false">(</m:mo>
                        <m:mn>1</m:mn>
                        <m:mo>&#8722;</m:mo>
                        <m:msup>
                           <m:mi>r</m:mi>
                           <m:mi>t</m:mi>
                        </m:msup>
                        <m:mo stretchy="false">)</m:mo>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGWbaCcqGGOaakcqWG0baDcqGGPaqkcqGH9aqpdaWcaaqaaiabdUgaRjabgkHiTiabigdaXaqaaiabdUgaRbaacqGGOaakcqaIXaqmcqGHsislcqWGYbGCdaahaaWcbeqaaiabdsha0baakiabcMcaPaaa@3D8D@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>where <it>r </it>is</p>
            <p>
               <m:math name="1471-2105-7-529-i22" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mi>r</m:mi>
                        <m:mo>=</m:mo>
                        <m:mn>1</m:mn>
                        <m:mo>&#8722;</m:mo>
                        <m:mfrac>
                           <m:mi>k</m:mi>
                           <m:mrow>
                              <m:mn>100</m:mn>
                              <m:mo stretchy="false">(</m:mo>
                              <m:mi>k</m:mi>
                              <m:mo>&#8722;</m:mo>
                              <m:mn>1</m:mn>
                              <m:mo stretchy="false">)</m:mo>
                           </m:mrow>
                        </m:mfrac>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaacqWGYbGCcqGH9aqpcqaIXaqmcqGHsisldaWcaaqaaiabdUgaRbqaaiabigdaXiabicdaWiabicdaWiabcIcaOiabdUgaRjabgkHiTiabigdaXiabcMcaPaaaaaa@3A25@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>so that <it>t </it>is measured in PAM units. (Measuring in PAM units is proportional to any other measure, and it means that at <it>t </it>= 1 one percent of the characters are changed, i.e. <it>p</it>(1) = 1/100.) and that all transitions are equally likely, and only depend on the PAM distance. Under this model, the log-likelihood can be expressed in terms of the counts of matches/mismatches of the triplet (<it>X</it>, <it>Y</it>, <it>Z</it>), i.e. <it>N</it><sub><it>xxx </it></sub>is the number of positions where all the characters are identical, <it>N</it><sub><it>xxz </it></sub>is the number of positions where <it>X </it>and <it>Y </it>coincide but <it>Z </it>differs, etc.</p>
            <p>
               <m:math name="1471-2105-7-529-i23" xmlns:m="http://www.w3.org/1998/Math/MathML">
                  <m:semantics>
                     <m:mrow>
                        <m:mtable>
                           <m:mtr>
                              <m:mtd>
                                 <m:mrow>
                                    <m:mi>l</m:mi>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:mi>A</m:mi>
                                    <m:mo>|</m:mo>
                                    <m:mi>t</m:mi>
                                    <m:mo stretchy="false">)</m:mo>
                                    <m:mo>=</m:mo>
                                    <m:msub>
                                       <m:mi>N</m:mi>
                                       <m:mrow>
                                          <m:mi>x</m:mi>
                                          <m:mi>x</m:mi>
                                          <m:mi>x</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mi>log</m:mi>
                                    <m:mo>&#8289;</m:mo>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:msub>
                                       <m:mi>P</m:mi>
                                       <m:mrow>
                                          <m:mi>x</m:mi>
                                          <m:mi>x</m:mi>
                                          <m:mi>x</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mo stretchy="false">)</m:mo>
                                    <m:mo>+</m:mo>
                                    <m:msub>
                                       <m:mi>N</m:mi>
                                       <m:mrow>
                                          <m:mi>x</m:mi>
                                          <m:mi>x</m:mi>
                                          <m:mi>z</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mi>log</m:mi>
                                    <m:mo>&#8289;</m:mo>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:msub>
                                       <m:mi>P</m:mi>
                                       <m:mrow>
                                          <m:mi>x</m:mi>
                                          <m:mi>x</m:mi>
                                          <m:mi>z</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mo stretchy="false">)</m:mo>
                                    <m:mo>+</m:mo>
                                    <m:msub>
                                       <m:mi>N</m:mi>
                                       <m:mrow>
                                          <m:mi>x</m:mi>
                                          <m:mi>y</m:mi>
                                          <m:mi>x</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mi>log</m:mi>
                                    <m:mo>&#8289;</m:mo>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:msub>
                                       <m:mi>P</m:mi>
                                       <m:mrow>
                                          <m:mi>x</m:mi>
                                          <m:mi>y</m:mi>
                                          <m:mi>x</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mo stretchy="false">)</m:mo>
                                    <m:mo>+</m:mo>
                                 </m:mrow>
                              </m:mtd>
                           </m:mtr>
                           <m:mtr>
                              <m:mtd>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mi>N</m:mi>
                                       <m:mrow>
                                          <m:mi>x</m:mi>
                                          <m:mi>y</m:mi>
                                          <m:mi>y</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mi>log</m:mi>
                                    <m:mo>&#8289;</m:mo>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:msub>
                                       <m:mi>P</m:mi>
                                       <m:mrow>
                                          <m:mi>x</m:mi>
                                          <m:mi>y</m:mi>
                                          <m:mi>y</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mo stretchy="false">)</m:mo>
                                    <m:mo>+</m:mo>
                                    <m:msub>
                                       <m:mi>N</m:mi>
                                       <m:mrow>
                                          <m:mi>x</m:mi>
                                          <m:mi>y</m:mi>
                                          <m:mi>z</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mi>log</m:mi>
                                    <m:mo>&#8289;</m:mo>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:msub>
                                       <m:mi>P</m:mi>
                                       <m:mrow>
                                          <m:mi>x</m:mi>
                                          <m:mi>y</m:mi>
                                          <m:mi>z</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mo stretchy="false">)</m:mo>
                                 </m:mrow>
                              </m:mtd>
                           </m:mtr>
                           <m:mtr>
                              <m:mtd>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mi>P</m:mi>
                                       <m:mrow>
                                          <m:mi>x</m:mi>
                                          <m:mi>x</m:mi>
                                          <m:mi>x</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mo>=</m:mo>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:mn>1</m:mn>
                                    <m:mo>&#8722;</m:mo>
                                    <m:msub>
                                       <m:mi>p</m:mi>
                                       <m:mi>x</m:mi>
                                    </m:msub>
                                    <m:mo stretchy="false">)</m:mo>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:mn>1</m:mn>
                                    <m:mo>&#8722;</m:mo>
                                    <m:msub>
                                       <m:mi>p</m:mi>
                                       <m:mi>y</m:mi>
                                    </m:msub>
                                    <m:mo stretchy="false">)</m:mo>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:mn>1</m:mn>
                                    <m:mo>&#8722;</m:mo>
                                    <m:msub>
                                       <m:mi>p</m:mi>
                                       <m:mi>z</m:mi>
                                    </m:msub>
                                    <m:mo stretchy="false">)</m:mo>
                                    <m:mo>+</m:mo>
                                    <m:mfrac>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>x</m:mi>
                                          </m:msub>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>y</m:mi>
                                          </m:msub>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>z</m:mi>
                                          </m:msub>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:msup>
                                             <m:mrow>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:mi>k</m:mi>
                                                <m:mo>&#8722;</m:mo>
                                                <m:mn>1</m:mn>
                                                <m:mo stretchy="false">)</m:mo>
                                             </m:mrow>
                                             <m:mn>2</m:mn>
                                          </m:msup>
                                       </m:mrow>
                                    </m:mfrac>
                                 </m:mrow>
                              </m:mtd>
                           </m:mtr>
                           <m:mtr>
                              <m:mtd>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mi>P</m:mi>
                                       <m:mrow>
                                          <m:mi>x</m:mi>
                                          <m:mi>x</m:mi>
                                          <m:mi>z</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mo>=</m:mo>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:mn>1</m:mn>
                                    <m:mo>&#8722;</m:mo>
                                    <m:msub>
                                       <m:mi>p</m:mi>
                                       <m:mi>x</m:mi>
                                    </m:msub>
                                    <m:mo stretchy="false">)</m:mo>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:mn>1</m:mn>
                                    <m:mo>&#8722;</m:mo>
                                    <m:msub>
                                       <m:mi>p</m:mi>
                                       <m:mi>y</m:mi>
                                    </m:msub>
                                    <m:mo stretchy="false">)</m:mo>
                                    <m:msub>
                                       <m:mi>p</m:mi>
                                       <m:mi>z</m:mi>
                                    </m:msub>
                                    <m:mo>+</m:mo>
                                    <m:mfrac>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>x</m:mi>
                                          </m:msub>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>y</m:mi>
                                          </m:msub>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:mn>1</m:mn>
                                          <m:mo>&#8722;</m:mo>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>z</m:mi>
                                          </m:msub>
                                          <m:mo stretchy="false">)</m:mo>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:mi>k</m:mi>
                                          <m:mo>&#8722;</m:mo>
                                          <m:mn>1</m:mn>
                                       </m:mrow>
                                    </m:mfrac>
                                    <m:mo>+</m:mo>
                                    <m:mfrac>
                                       <m:mrow>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:mi>k</m:mi>
                                          <m:mo>&#8722;</m:mo>
                                          <m:mn>2</m:mn>
                                          <m:mo stretchy="false">)</m:mo>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>x</m:mi>
                                          </m:msub>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>y</m:mi>
                                          </m:msub>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>z</m:mi>
                                          </m:msub>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:msup>
                                             <m:mrow>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:mi>k</m:mi>
                                                <m:mo>&#8722;</m:mo>
                                                <m:mn>1</m:mn>
                                                <m:mo stretchy="false">)</m:mo>
                                             </m:mrow>
                                             <m:mn>2</m:mn>
                                          </m:msup>
                                       </m:mrow>
                                    </m:mfrac>
                                 </m:mrow>
                              </m:mtd>
                           </m:mtr>
                           <m:mtr>
                              <m:mtd>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mi>P</m:mi>
                                       <m:mrow>
                                          <m:mi>x</m:mi>
                                          <m:mi>y</m:mi>
                                          <m:mi>x</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mo>=</m:mo>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:mn>1</m:mn>
                                    <m:mo>&#8722;</m:mo>
                                    <m:msub>
                                       <m:mi>p</m:mi>
                                       <m:mi>x</m:mi>
                                    </m:msub>
                                    <m:mo stretchy="false">)</m:mo>
                                    <m:msub>
                                       <m:mi>p</m:mi>
                                       <m:mi>y</m:mi>
                                    </m:msub>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:mn>1</m:mn>
                                    <m:mo>&#8722;</m:mo>
                                    <m:msub>
                                       <m:mi>p</m:mi>
                                       <m:mi>z</m:mi>
                                    </m:msub>
                                    <m:mo stretchy="false">)</m:mo>
                                    <m:mo>+</m:mo>
                                    <m:mfrac>
                                       <m:mrow>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>x</m:mi>
                                          </m:msub>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:mn>1</m:mn>
                                          <m:mo>&#8722;</m:mo>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>y</m:mi>
                                          </m:msub>
                                          <m:mo stretchy="false">)</m:mo>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>z</m:mi>
                                          </m:msub>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:mi>k</m:mi>
                                          <m:mo>&#8722;</m:mo>
                                          <m:mn>1</m:mn>
                                       </m:mrow>
                                    </m:mfrac>
                                    <m:mo>+</m:mo>
                                    <m:mfrac>
                                       <m:mrow>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:mi>k</m:mi>
                                          <m:mo>&#8722;</m:mo>
                                          <m:mn>2</m:mn>
                                          <m:mo stretchy="false">)</m:mo>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>x</m:mi>
                                          </m:msub>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>y</m:mi>
                                          </m:msub>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>z</m:mi>
                                          </m:msub>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:msup>
                                             <m:mrow>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:mi>k</m:mi>
                                                <m:mo>&#8722;</m:mo>
                                                <m:mn>1</m:mn>
                                                <m:mo stretchy="false">)</m:mo>
                                             </m:mrow>
                                             <m:mn>2</m:mn>
                                          </m:msup>
                                       </m:mrow>
                                    </m:mfrac>
                                 </m:mrow>
                              </m:mtd>
                           </m:mtr>
                           <m:mtr>
                              <m:mtd>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mi>P</m:mi>
                                       <m:mrow>
                                          <m:mi>x</m:mi>
                                          <m:mi>y</m:mi>
                                          <m:mi>y</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mo>=</m:mo>
                                    <m:msub>
                                       <m:mi>p</m:mi>
                                       <m:mi>x</m:mi>
                                    </m:msub>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:mn>1</m:mn>
                                    <m:mo>&#8722;</m:mo>
                                    <m:msub>
                                       <m:mi>p</m:mi>
                                       <m:mi>y</m:mi>
                                    </m:msub>
                                    <m:mo stretchy="false">)</m:mo>
                                    <m:mo stretchy="false">(</m:mo>
                                    <m:mn>1</m:mn>
                                    <m:mo>&#8722;</m:mo>
                                    <m:msub>
                                       <m:mi>p</m:mi>
                                       <m:mi>z</m:mi>
                                    </m:msub>
                                    <m:mo stretchy="false">)</m:mo>
                                    <m:mo>+</m:mo>
                                    <m:mfrac>
                                       <m:mrow>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:mn>1</m:mn>
                                          <m:mo>&#8722;</m:mo>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>x</m:mi>
                                          </m:msub>
                                          <m:mo stretchy="false">)</m:mo>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>y</m:mi>
                                          </m:msub>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>z</m:mi>
                                          </m:msub>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:mi>k</m:mi>
                                          <m:mo>&#8722;</m:mo>
                                          <m:mn>1</m:mn>
                                       </m:mrow>
                                    </m:mfrac>
                                    <m:mo>+</m:mo>
                                    <m:mfrac>
                                       <m:mrow>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:mi>k</m:mi>
                                          <m:mo>&#8722;</m:mo>
                                          <m:mn>2</m:mn>
                                          <m:mo stretchy="false">)</m:mo>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>x</m:mi>
                                          </m:msub>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>y</m:mi>
                                          </m:msub>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>z</m:mi>
                                          </m:msub>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:msup>
                                             <m:mrow>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:mi>k</m:mi>
                                                <m:mo>&#8722;</m:mo>
                                                <m:mn>1</m:mn>
                                                <m:mo stretchy="false">)</m:mo>
                                             </m:mrow>
                                             <m:mn>2</m:mn>
                                          </m:msup>
                                       </m:mrow>
                                    </m:mfrac>
                                 </m:mrow>
                              </m:mtd>
                           </m:mtr>
                           <m:mtr>
                              <m:mtd>
                                 <m:mrow>
                                    <m:msub>
                                       <m:mi>P</m:mi>
                                       <m:mrow>
                                          <m:mi>x</m:mi>
                                          <m:mi>y</m:mi>
                                          <m:mi>z</m:mi>
                                       </m:mrow>
                                    </m:msub>
                                    <m:mo>=</m:mo>
                                    <m:mfrac>
                                       <m:mrow>
                                          <m:mi>k</m:mi>
                                          <m:mo>&#8722;</m:mo>
                                          <m:mn>2</m:mn>
                                       </m:mrow>
                                       <m:mrow>
                                          <m:mi>k</m:mi>
                                          <m:mo>&#8722;</m:mo>
                                          <m:mn>1</m:mn>
                                       </m:mrow>
                                    </m:mfrac>
                                    <m:mrow>
                                       <m:mo>(</m:mo>
                                       <m:mrow>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:mn>1</m:mn>
                                          <m:mo>&#8722;</m:mo>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>x</m:mi>
                                          </m:msub>
                                          <m:mo stretchy="false">)</m:mo>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>y</m:mi>
                                          </m:msub>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>z</m:mi>
                                          </m:msub>
                                          <m:mo>+</m:mo>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>x</m:mi>
                                          </m:msub>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:mn>1</m:mn>
                                          <m:mo>&#8722;</m:mo>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>y</m:mi>
                                          </m:msub>
                                          <m:mo stretchy="false">)</m:mo>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>z</m:mi>
                                          </m:msub>
                                          <m:mo>+</m:mo>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>x</m:mi>
                                          </m:msub>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>y</m:mi>
                                          </m:msub>
                                          <m:mo stretchy="false">(</m:mo>
                                          <m:mn>1</m:mn>
                                          <m:mo>&#8722;</m:mo>
                                          <m:msub>
                                             <m:mi>p</m:mi>
                                             <m:mi>z</m:mi>
                                          </m:msub>
                                          <m:mo stretchy="false">)</m:mo>
                                          <m:mo>+</m:mo>
                                          <m:mfrac>
                                             <m:mrow>
                                                <m:mo stretchy="false">(</m:mo>
                                                <m:mi>k</m:mi>
                                                <m:mo>&#8722;</m:mo>
                                                <m:mn>3</m:mn>
                                                <m:mo stretchy="false">)</m:mo>
                                                <m:msub>
                                                   <m:mi>p</m:mi>
                                                   <m:mi>x</m:mi>
                                                </m:msub>
                                                <m:msub>
                                                   <m:mi>p</m:mi>
                                                   <m:mi>y</m:mi>
                                                </m:msub>
                                                <m:msub>
                                                   <m:mi>p</m:mi>
                                                   <m:mi>z</m:mi>
                                                </m:msub>
                                             </m:mrow>
                                             <m:mrow>
                                                <m:mi>k</m:mi>
                                                <m:mo>&#8722;</m:mo>
                                                <m:mn>1</m:mn>
                                             </m:mrow>
                                          </m:mfrac>
                                       </m:mrow>
                                       <m:mo>)</m:mo>
                                    </m:mrow>
                                 </m:mrow>
                              </m:mtd>
                           </m:mtr>
                        </m:mtable>
                     </m:mrow>
                     <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaafaqabeWbbaaaaeaacqWGSbaBcqGGOaakcqWGbbqqcqGG8baFcqWG0baDcqGGPaqkcqGH9aqpcqWGobGtdaWgaaWcbaGaemiEaGNaemiEaGNaemiEaGhabeaakiGbcYgaSjabc+gaVjabcEgaNjabcIcaOiabdcfaqnaaBaaaleaacqWG4baEcqWG4baEcqWG4baEaeqaaOGaeiykaKIaey4kaSIaemOta40aaSbaaSqaaiabdIha4jabdIha4jabdQha6bqabaGccyGGSbaBcqGGVbWBcqGGNbWzcqGGOaakcqWGqbaudaWgaaWcbaGaemiEaGNaemiEaGNaemOEaOhabeaakiabcMcaPiabgUcaRiabd6eaonaaBaaaleaacqWG4baEcqWG5bqEcqWG4baEaeqaaOGagiiBaWMaei4Ba8Maei4zaCMaeiikaGIaemiuaa1aaSbaaSqaaiabdIha4jabdMha5jabdIha4bqabaGccqGGPaqkcqGHRaWkaeaacqWGobGtdaWgaaWcbaGaemiEaGNaemyEaKNaemyEaKhabeaakiGbcYgaSjabc+gaVjabcEgaNjabcIcaOiabdcfaqnaaBaaaleaacqWG4baEcqWG5bqEcqWG5bqEaeqaaOGaeiykaKIaey4kaSIaemOta40aaSbaaSqaaiabdIha4jabdMha5jabdQha6bqabaGccyGGSbaBcqGGVbWBcqGGNbWzcqGGOaakcqWGqbaudaWgaaWcbaGaemiEaGNaemyEaKNaemOEaOhabeaakiabcMcaPaqaaiabdcfaqnaaBaaaleaacqWG4baEcqWG4baEcqWG4baEaeqaaOGaeyypa0JaeiikaGIaeGymaeJaeyOeI0IaemiCaa3aaSbaaSqaaiabdIha4bqabaGccqGGPaqkcqGGOaakcqaIXaqmcqGHsislcqWGWbaCdaWgaaWcbaGaemyEaKhabeaakiabcMcaPiabcIcaOiabigdaXiabgkHiTiabdchaWnaaBaaaleaacqWG6bGEaeqaaOGaeiykaKIaey4kaSYaaSaaaeaacqWGWbaCdaWgaaWcbaGaemiEaGhabeaakiabdchaWnaaBaaaleaacqWG5bqEaeqaaOGaemiCaa3aaSbaaSqaaiabdQha6bqabaaakeaacqGGOaakcqWGRbWAcqGHsislcqaIXaqmcqGGPaqkdaahaaWcbeqaaiabikdaYaaaaaaakeaacqWGqbaudaWgaaWcbaGaemiEaGNaemiEaGNaemOEaOhabeaakiabg2da9iabcIcaOiabigdaXiabgkHiTiabdchaWnaaBaaaleaacqWG4baEaeqaaOGaeiykaKIaeiikaGIaeGymaeJaeyOeI0IaemiCaa3aaSbaaSqaaiabdMha5bqabaGccqGGPaqkcqWGWbaCdaWgaaWcbaGaemOEaOhabeaakiabgUcaRmaalaaabaGaemiCaa3aaSbaaSqaaiabdIha4bqabaGccqWGWbaCdaWgaaWcbaGaemyEaKhabeaakiabcIcaOiabigdaXiabgkHiTiabdchaWnaaBaaaleaacqWG6bGEaeqaaOGaeiykaKcabaGaem4AaSMaeyOeI0IaeGymaedaaiabgUcaRmaalaaabaGaeiikaGIaem4AaSMaeyOeI0IaeGOmaiJaeiykaKIaemiCaa3aaSbaaSqaaiabdIha4bqabaGccqWGWbaCdaWgaaWcbaGaemyEaKhabeaakiabdchaWnaaBaaaleaacqWG6bGEaeqaaaGcbaGaeiikaGIaem4AaSMaeyOeI0IaeGymaeJaeiykaKYaaWbaaSqabeaacqaIYaGmaaaaaaGcbaGaemiuaa1aaSbaaSqaaiabdIha4jabdMha5jabdIha4bqabaGccqGH9aqpcqGGOaakcqaIXaqmcqGHsislcqWGWbaCdaWgaaWcbaGaemiEaGhabeaakiabcMcaPiabdchaWnaaBaaaleaacqWG5bqEaeqaaOGaeiikaGIaeGymaeJaeyOeI0IaemiCaa3aaSbaaSqaaiabdQha6bqabaGccqGGPaqkcqGHRaWkdaWcaaqaaiabdchaWnaaBaaaleaacqWG4baEaeqaaOGaeiikaGIaeGymaeJaeyOeI0IaemiCaa3aaSbaaSqaaiabdMha5bqabaGccqGGPaqkcqWGWbaCdaWgaaWcbaGaemOEaOhabeaaaOqaaiabdUgaRjabgkHiTiabigdaXaaacqGHRaWkdaWcaaqaaiabcIcaOiabdUgaRjabgkHiTiabikdaYiabcMcaPiabdchaWnaaBaaaleaacqWG4baEaeqaaOGaemiCaa3aaSbaaSqaaiabdMha5bqabaGccqWGWbaCdaWgaaWcbaGaemOEaOhabeaaaOqaaiabcIcaOiabdUgaRjabgkHiTiabigdaXiabcMcaPmaaCaaaleqabaGaeGOmaidaaaaaaOqaaiabdcfaqnaaBaaaleaacqWG4baEcqWG5bqEcqWG5bqEaeqaaOGaeyypa0JaemiCaa3aaSbaaSqaaiabdIha4bqabaGccqGGOaakcqaIXaqmcqGHsislcqWGWbaCdaWgaaWcbaGaemyEaKhabeaakiabcMcaPiabcIcaOiabigdaXiabgkHiTiabdchaWnaaBaaaleaacqWG6bGEaeqaaOGaeiykaKIaey4kaSYaaSaaaeaacqGGOaakcqaIXaqmcqGHsislcqWGWbaCdaWgaaWcbaGaemiEaGhabeaakiabcMcaPiabdchaWnaaBaaaleaacqWG5bqEaeqaaOGaemiCaa3aaSbaaSqaaiabdQha6bqabaaakeaacqWGRbWAcqGHsislcqaIXaqmaaGaey4kaSYaaSaaaeaacqGGOaakcqWGRbWAcqGHsislcqaIYaGmcqGGPaqkcqWGWbaCdaWgaaWcbaGaemiEaGhabeaakiabdchaWnaaBaaaleaacqWG5bqEaeqaaOGaemiCaa3aaSbaaSqaaiabdQha6bqabaaakeaacqGGOaakcqWGRbWAcqGHsislcqaIXaqmcqGGPaqkdaahaaWcbeqaaiabikdaYaaaaaaakeaacqWGqbaudaWgaaWcbaGaemiEaGNaemyEaKNaemOEaOhabeaakiabg2da9maalaaabaGaem4AaSMaeyOeI0IaeGOmaidabaGaem4AaSMaeyOeI0IaeGymaedaamaabmaabaGaeiikaGIaeGymaeJaeyOeI0IaemiCaa3aaSbaaSqaaiabdIha4bqabaGccqGGPaqkcqWGWbaCdaWgaaWcbaGaemyEaKhabeaakiabdchaWnaaBaaaleaacqWG6bGEaeqaaOGaey4kaSIaemiCaa3aaSbaaSqaaiabdIha4bqabaGccqGGOaakcqaIXaqmcqGHsislcqWGWbaCdaWgaaWcbaGaemyEaKhabeaakiabcMcaPiabdchaWnaaBaaaleaacqWG6bGEaeqaaOGaey4kaSIaemiCaa3aaSbaaSqaaiabdIha4bqabaGccqWGWbaCdaWgaaWcbaGaemyEaKhabeaakiabcIcaOiabigdaXiabgkHiTiabdchaWnaaBaaaleaacqWG6bGEaeqaaOGaeiykaKIaey4kaSYaaSaaaeaacqGGOaakcqWGRbWAcqGHsislcqaIZaWmcqGGPaqkcqWGWbaCdaWgaaWcbaGaemiEaGhabeaakiabdchaWnaaBaaaleaacqWG5bqEaeqaaOGaemiCaa3aaSbaaSqaaiabdQha6bqabaaakeaacqWGRbWAcqGHsislcqaIXaqmaaaacaGLOaGaayzkaaaaaaaa@BCE8@</m:annotation>
                  </m:semantics>
               </m:math>
            </p>
            <p>where <it>p</it><sub><it>x </it></sub>is the probability of mutating from the origin to <it>X </it>and similarly for <it>p</it><sub><it>y </it></sub>and <it>p</it><sub><it>z</it></sub>. Taking partial derivatives of the likelihood with respect to <it>p</it><sub><it>x</it></sub>, <it>p</it><sub><it>y </it></sub>and <it>p</it><sub><it>z </it></sub>gives a system of 3 rational polynomial equations (all the logarithms disappear) in 3 unknowns and 6 parameters. Such a system of equations has a solution that will be an algebraic function of the parameters (a root of a polynomial, where the coefficients of the polynomial involve the parameters). Despite its simple appearance, this system of equations is beyond the capabilities of current computer algebra systems to resolve. And this is not a complete surprise, as the algebraic numbers/functions involved are at least of degree 23. The special case where two of the branches have the same length, has been solved exactly in <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>, they find that their solution is an algebraic function of degree 11. This unfortunately is not applicable as we are interested in the cases where the branches away from the origin are of different lengths.</p>
            <p>We have computed the exact solution for concrete values of the parameters, in particular <it>N</it><sub><it>xxx </it></sub>= 10, <it>N</it><sub><it>xxz </it></sub>= 5, <it>N</it><sub><it>xyx </it></sub>= 4, <it>N</it><sub><it>xyy </it></sub>= 3, <it>N</it><sub><it>xyz </it></sub>= 2, <it>k </it>= 3 using Maple and the value of <it>p</it><sub><it>x </it></sub>is a root of the irreducible polynomial</p>
            <p>-6582435840000 + 189590785228800 <it>z </it>- 2438333515038720 <it>z</it><sup>2 </sup>+ ...</p>
            <p>... + 10304020514917800 <it>z</it><sup>21 </sup>- 1635488137841976 <it>z</it><sup>22 </sup>+ 99990709180560 <it>z</it><sup>23</sup></p>
            <p>This means that the general solution will be an algebraic function of degree 23 or higher, it cannot be lower. If an instantiation of the polynomial with values gives this irreducible polynomial, then the general polynomial must be irreducible of degree 23 or higher (some terms could have simplified in the instantiation). This makes the usefulness of an exact solution inexistent. it is more difficult to solve the polynomial and select the right root than to maximize the likelihood and/or solve the system of equations by numerical methods.</p>
         </sec>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors thank Dan Graur and two anonymous reviewers for helpful comments and ideas.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <aug>
               <au>
                  <snm>Swofford</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Olsen</snm>
                  <fnm>GL</fnm>
               </au>
               <au>
                  <snm>Waddell</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Hillis</snm>
                  <fnm>DM</fnm>
               </au>
            </aug>
            <source>Phylogenetic inference</source>
            <publisher>Sunderland, Massachusetts: Sinauer Associates</publisher>
            <edition>2</edition>
            <pubdate>1996</pubdate>
            <fpage>407</fpage>
            <lpage>514</lpage>
         </bibl>
         <bibl id="B2">
            <title>
               <p>PHYLIP (Phylogeny Inference Package) version 3.6. Distributed by the author</p>
            </title>
            <aug>
               <au>
                  <snm>Felsenstein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <publisher>University of Washington. Seattle., Department of Genome Sciences</publisher>
            <pubdate>2004</pubdate>
         </bibl>
         <bibl id="B3">
            <title>
               <p>The KEGG resource for deciphering the genome</p>
            </title>
            <aug>
               <au>
                  <snm>Kanehisa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Goto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kawashima</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Okuno</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <issue>32 Database</issue>
            <fpage>277</fpage>
            <lpage>280</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/nar/gkh063</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>OMA, A Comprehensive, Automated Project for the Identification of Orthologs from Complete Genome Data: Introduction and First Achievements</p>
            </title>
            <aug>
               <au>
                  <snm>Dessimoz</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Cannarozzi</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Gil</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Margadant</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Roth</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schneider</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gonnet</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>RECOMB 2005 Workshop on Comparative Genomics, Volume LNBI 3678 of Lecture Notes in Bioinformatics</source>
            <publisher>Springer-Verlag</publisher>
            <editor>McLysath A, Huson DH</editor>
            <pubdate>2005</pubdate>
            <fpage>61</fpage>
            <lpage>72</lpage>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Roundup: a multi-genome repository of orthologs and evolutionary distances</p>
            </title>
            <aug>
               <au>
                  <snm>DeLuca</snm>
                  <fnm>TF</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>IH</fnm>
               </au>
               <au>
                  <snm>Pu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Monaghan</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Peshkin</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Singh</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wall</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <issue>16</issue>
            <fpage>2044</fpage>
            <lpage>2046</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btl286</pubid>
                  <pubid idtype="pmpid" link="fulltext">16777906</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Evolution of protein molecules</p>
            </title>
            <aug>
               <au>
                  <snm>Jukes</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Cantor</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Mammalian protein metabolism III</source>
            <publisher>New York: Academic Press</publisher>
            <editor>Munro H</editor>
            <pubdate>1969</pubdate>
            <fpage>21</fpage>
            <lpage>132</lpage>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Dating of the human-ape splitting by a molecular clock of mitochondrial DNA</p>
            </title>
            <aug>
               <au>
                  <snm>Hasegawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kishino</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yano</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1985</pubdate>
            <volume>22</volume>
            <fpage>160</fpage>
            <lpage>174</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF02101694</pubid>
                  <pubid idtype="pmpid">3934395</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>A model for evolutionary change in proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Dayhoff</snm>
                  <fnm>MO</fnm>
               </au>
               <au>
                  <snm>Schwartz</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Orcutt</snm>
                  <fnm>BC</fnm>
               </au>
            </aug>
            <source>Atlas of Protein Sequence and Structure</source>
            <publisher>National Biomedical Research Foundation</publisher>
            <editor>Dayhoff MO</editor>
            <pubdate>1978</pubdate>
            <volume>5</volume>
            <fpage>345</fpage>
            <lpage>352</lpage>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Exhaustive matching of the entire protein sequence database</p>
            </title>
            <aug>
               <au>
                  <snm>Gonnet</snm>
                  <fnm>GH</fnm>
               </au>
               <au>
                  <snm>Cohen</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Benner</snm>
                  <fnm>SA</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1992</pubdate>
            <volume>256</volume>
            <issue>5003</issue>
            <fpage>1443</fpage>
            <lpage>1445</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1604319</pubid>
                  <pubid idtype="pmpid" link="fulltext">1604319</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>The Rapid Generation of Mutation Data Matrices from Protein Sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Jones</snm>
                  <fnm>DT</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Thornton</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Comput Applic Biosci</source>
            <pubdate>1992</pubdate>
            <volume>8</volume>
            <fpage>275</fpage>
            <lpage>282</lpage>
         </bibl>
         <bibl id="B11">
            <title>
               <p>A Codon-based Model of Nucleotide Substitution for Protein-coding DNA Sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Goldman</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1994</pubdate>
            <volume>11</volume>
            <issue>5</issue>
            <fpage>725</fpage>
            <lpage>736</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">7968486</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Empirical codon substitution matrix</p>
            </title>
            <aug>
               <au>
                  <snm>Schneider</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Cannarozzi</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Gonnet</snm>
                  <fnm>GH</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <issue>134</issue>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1173088</pubid>
                  <pubid idtype="pmpid" link="fulltext">15927081</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Darwin v. 2.0: An Interpreted Computer Language for the Biosciences</p>
            </title>
            <aug>
               <au>
                  <snm>Gonnet</snm>
                  <fnm>GH</fnm>
               </au>
               <au>
                  <snm>Hallett</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Korostensky</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bernardin</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <issue>2</issue>
            <fpage>101</fpage>
            <lpage>103</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/16.2.101</pubid>
                  <pubid idtype="pmpid" link="fulltext">10842729</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Detecting Non-Orthology in the COG Database and Other Approaches Grouping Orthologs Using Genome-Specific Best Hits</p>
            </title>
            <aug>
               <au>
                  <snm>Dessimoz</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Boeckmann</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Roth</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gonnet</snm>
                  <fnm>GH</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <issue>11</issue>
            <fpage>3309</fpage>
            <lpage>3316</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1500873</pubid>
                  <pubid idtype="pmpid" link="fulltext">16835308</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl433</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <aug>
               <au>
                  <snm>Ohno</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Evolution by Gene Duplication</source>
            <publisher>Springer-Verlag, New York</publisher>
            <pubdate>1970</pubdate>
         </bibl>
         <bibl id="B16">
            <title>
               <p>The Ghost of Selection Past: Rates of Evolution and Functional Divergence of Anciently Duplicated Genes</p>
            </title>
            <aug>
               <au>
                  <snm>Van de Peer</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Braasch</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>2001</pubdate>
            <volume>53</volume>
            <issue>4</issue>
            <fpage>436</fpage>
            <lpage>446</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s002390010233</pubid>
                  <pubid idtype="pmpid" link="fulltext">11675603</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Differential Selection After Duplication in Mammalian Developmental Genes</p>
            </title>
            <aug>
               <au>
                  <snm>Dermitzakis</snm>
                  <fnm>ET</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>AG</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2001</pubdate>
            <volume>18</volume>
            <fpage>557</fpage>
            <lpage>562</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11264407</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Phylogenetic analysis of vertebrate lactate dehydrogenase (LDH) multigene families</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>YJ</fnm>
               </au>
               <au>
                  <snm>Tsoi</snm>
                  <fnm>SCM</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>2002</pubdate>
            <volume>54</volume>
            <issue>5</issue>
            <fpage>614</fpage>
            <lpage>24</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00239-001-0058-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">11965434</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Asymmetric Functional Divergence of Duplicate Genes in Yeast</p>
            </title>
            <aug>
               <au>
                  <snm>Wagner</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2002</pubdate>
            <volume>19</volume>
            <fpage>1760</fpage>
            <lpage>1768</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12270902</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Very Low Power to Detect Asymmetric Divergence of Duplicated Genes</p>
            </title>
            <aug>
               <au>
                  <snm>Seoighe</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Scheffler</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>RECOMB 2005 Workshop on Comparative Genomics, Volume LNBI 3678 of Lecture Notes in Bioinformatics</source>
            <publisher>Springer-Verlag</publisher>
            <editor>McLysath A, Huson DH</editor>
            <pubdate>2005</pubdate>
            <fpage>142</fpage>
            <lpage>152</lpage>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Proof and evolutionary analysis of ancient genome duplication in the yeast Saccharomyces cerevisiae</p>
            </title>
            <aug>
               <au>
                  <snm>Kellis</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Birren</snm>
                  <fnm>BW</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>428</volume>
            <fpage>617</fpage>
            <lpage>624</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature02424</pubid>
                  <pubid idtype="pmpid" link="fulltext">15004568</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Extensive Duplication and Reshuffling in the Arabidopsis Genome</p>
            </title>
            <aug>
               <au>
                  <snm>Blanc</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Barakat</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Guyot</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Cooke</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Delseny</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2000</pubdate>
            <volume>12</volume>
            <fpage>1093</fpage>
            <lpage>1102</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">149051</pubid>
                  <pubid idtype="pmpid" link="fulltext">10899976</pubid>
                  <pubid idtype="doi">10.1105/tpc.12.7.1093</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Asymmetric Sequence Divergence of Duplicate Genes</p>
            </title>
            <aug>
               <au>
                  <snm>Conant</snm>
                  <fnm>GC</fnm>
               </au>
               <au>
                  <snm>Wagner</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>2052</fpage>
            <lpage>2058</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403682</pubid>
                  <pubid idtype="pmpid" link="fulltext">12952876</pubid>
                  <pubid idtype="doi">10.1101/gr.1252603</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>The Closest BLAST Hit Is Often Not the Nearest Neighbor</p>
            </title>
            <aug>
               <au>
                  <snm>Koski</snm>
                  <fnm>LB</fnm>
               </au>
               <au>
                  <snm>Golding</snm>
                  <fnm>GB</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>2001</pubdate>
            <volume>52</volume>
            <issue>6</issue>
            <fpage>540</fpage>
            <lpage>542</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11443357</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Evolutionary trees from gene frequencies and quantitative characters: finding maximum likelihood estimates</p>
            </title>
            <aug>
               <au>
                  <snm>Felsenstein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Evolution</source>
            <pubdate>1981</pubdate>
            <volume>35</volume>
            <fpage>1229</fpage>
            <lpage>1242</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/2408134</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>A Tutorial Introduction to Computational Biochemistry Using Darwin</p>
            </title>
            <aug>
               <au>
                  <snm>Gonnet</snm>
                  <fnm>GH</fnm>
               </au>
            </aug>
            <publisher>Tech. rep., Informatik, ETH Zurich, Switzerland</publisher>
            <pubdate>1994</pubdate>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Modeling amino acid replacement</p>
            </title>
            <aug>
               <au>
                  <snm>Muller</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Vingron</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Comput Biol</source>
            <pubdate>2000</pubdate>
            <volume>7</volume>
            <issue>6</issue>
            <fpage>761</fpage>
            <lpage>776</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1089/10665270050514918</pubid>
                  <pubid idtype="pmpid" link="fulltext">11382360</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Use of the method of generalized least-squares in reconstructing phylogenies from sequence data</p>
            </title>
            <aug>
               <au>
                  <snm>Bulmer</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1991</pubdate>
            <volume>8</volume>
            <issue>6</issue>
            <fpage>868</fpage>
            <lpage>883</lpage>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Idealized Mutational Clocks</p>
            </title>
            <aug>
               <au>
                  <snm>Cannarozzi</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Gonnet</snm>
                  <fnm>GH</fnm>
               </au>
            </aug>
            <publisher>Tech. rep., Informatik, ETH, Zurich</publisher>
            <pubdate>2005</pubdate>
            <url>http://www.biorecipes.com/IdealMut/code.html</url>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1173088</pubid>
                  <pubid idtype="pmpid" link="fulltext">15927081</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Maximum Likelihood Jukes-Cantor Triplets: Analytic Solutions</p>
            </title>
            <aug>
               <au>
                  <snm>Chor</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Hendy</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Snir</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2006</pubdate>
            <volume>23</volume>
            <issue>3</issue>
            <fpage>626</fpage>
            <lpage>632</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msj069</pubid>
                  <pubid idtype="pmpid" link="fulltext">16319091</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>

