<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2148-7-241</ui>
   <ji>1471-2148</ji>
   <fm>
      <dochead>Database</dochead>
      <bibl>
         <title>
            <p>OrthoMaM: A database of orthologous genomic markers for placental mammal phylogenetics</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Ranwez</snm>
               <fnm>Vincent</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>ranwez@isem.univ-montp2.fr</email>
            </au>
            <au id="A2">
               <snm>Delsuc</snm>
               <fnm>Fr&#233;d&#233;ric</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>delsuc@isem.univ-montp2.fr</email>
            </au>
            <au id="A3">
               <snm>Ranwez</snm>
               <fnm>Sylvie</fnm>
               <insr iid="I3"/>
               <email>sylvie.ranwez@ema.fr</email>
            </au>
            <au id="A4">
               <snm>Belkhir</snm>
               <fnm>Khalid</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>belkhir@univ-montp2.fr</email>
            </au>
            <au id="A5">
               <snm>Tilak</snm>
               <fnm>Marie-Ka</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>marika@isem.univ-montp2.fr</email>
            </au>
            <au id="A6">
               <snm>Douzery</snm>
               <mi>JP</mi>
               <fnm>Emmanuel</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>douzery@isem.univ-montp2.fr</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Universit&#233; Montpellier 2, CC064, Place Eug&#232;ne Bataillon, 34 095 Montpellier Cedex 05, France</p>
            </ins>
            <ins id="I2">
               <p>CNRS, Institut des Sciences de l'Evolution (UMR 5554), CC064, Place Eug&#232;ne Bataillon, 34 095 Montpellier Cedex 05, France</p>
            </ins>
            <ins id="I3">
               <p>Centre de Recherche LGI2P, Ecole des Mines d'Al&#232;s, Site EERIE Parc scientifique G. Besse, 30035 N&#238;mes Cedex 1, France</p>
            </ins>
         </insg>
         <source>BMC Evolutionary Biology</source>
         <issn>1471-2148</issn>
         <pubdate>2007</pubdate>
         <volume>7</volume>
         <issue>1</issue>
         <fpage>241</fpage>
         <url>http://www.biomedcentral.com/1471-2148/7/241</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18053139</pubid>
               <pubid idtype="doi">10.1186/1471-2148-7-241</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>13</day>
               <month>7</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>30</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>30</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Ranwez et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Molecular sequence data have become the standard in modern day phylogenetics. In particular, several long-standing questions of mammalian evolutionary history have been recently resolved thanks to the use of molecular characters. Yet, most studies have focused on only a handful of standard markers. The availability of an ever increasing number of whole genome sequences is a golden mine for modern systematics. Genomic data now provide the opportunity to select new markers that are potentially relevant for further resolving branches of the mammalian phylogenetic tree at various taxonomic levels.</p>
            </sec>
            <sec>
               <st>
                  <p>Description</p>
               </st>
               <p>The EnsEMBL database was used to determine a set of orthologous genes from 12 available complete mammalian genomes. As targets for possible amplification and sequencing in additional taxa, more than 3,000 exons of length > 400 bp have been selected, among which 118, 368, 608, and 674 are respectively retrieved for 12, 11, 10, and 9 species. A bioinformatic pipeline has been developed to provide evolutionary descriptors for these candidate markers in order to assess their potential phylogenetic utility. The resulting OrthoMaM (Orthologous Mammalian Markers) database can be queried and alignments can be downloaded through a dedicated web interface <url>http://kimura.univ-montp2.fr/orthomam</url>.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The importance of marker choice in phylogenetic studies has long been stressed. Our database centered on complete genome information now makes possible to select promising markers to a given phylogenetic question or a systematic framework by querying a number of evolutionary descriptors. The usefulness of the database is illustrated with two biological examples. First, two potentially useful markers were identified for rodent systematics based on relevant evolutionary parameters and sequenced in additional species. Second, a complete, gapless 94 kb supermatrix of 118 orthologous exons was assembled for 12 mammals. Phylogenetic analyses using probabilistic methods unambiguously supported the new placental phylogeny by retrieving the monophyly of Glires, Euarchontoglires, Laurasiatheria, and Boreoeutheria. Muroid rodents thus do not represent a basal placental lineage as it was mistakenly reasserted in some recent phylogenomic analyses based on fewer taxa. We expect the OrthoMaM database to be useful for further resolving the phylogenetic tree of placental mammals and for better understanding the evolutionary dynamics of their genomes, i.e., the forces that shaped coding sequences in terms of selective constraints.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Mammalian systematics has been a pioneering field in the use of molecular sequence data for inferring phylogenetic relationships. Molecular phylogenies at different levels of the mammalian evolutionary tree have accumulated since the seminal studies published in the early 1990s. Among the first genes to be used was the mitochondrial Cytochrome b (<it>MT-CYB</it>) <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> which since became the "barcode" marker for mammals with more than 20,000 sequences currently available representing about 2,000 species. The mitochondrial 12S rRNA gene has also early been considered but its use was limited by its less straightforward alignment <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. Acknowledging the limits of single gene phylogenies, the potential of complete mitochondrial genomes for reconstructing placental orders relationships was early explored, but with a somewhat limited success as judged <it>a posteriori </it><abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. In parallel, the first efforts to identify single-copy orthologous nuclear markers for mammalian phylogenetics have been made from conserved, large-sized exons. The exon 1 of the Retinol Binding Protein 3 (<it>RBP3</it>) &#8211; also known as the Interphotoreceptor Retinoid-Binding Protein (<it>IRBP</it>) &#8211; was the first to be developed <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> later followed by the exon 28 of the von Willebrand Factor gene (<it>VWF</it>) <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Since then, other nuclear genes have acquired the status of "standard" mammalian phylogenetic markers. This is for example the case of the intronless Recombination Activating Gene 1 (<it>RAG1</it>) and &#945;-2B Adrenergic Receptor (<it>ADRA2B</it>) genes, the Growth Hormone Receptor (<it>GHR</it>) exon 10, the c-myc proto-oncogen (<it>MYC</it>) exon 3, and the Breast Cancer Associated protein 1 (<it>BRCA1</it>) exon 11. Either used in single gene phylogenies at their beginnings or in combination later <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>, this handful of markers has proven to be useful for unravelling unsuspected clades at different levels of the mammalian taxonomy, like for instance Afrotheria <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, Cetacea + Hippopotamidae <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, and the grouping of shrews and hedgehogs to the exclusion of moles within Eulipotyphla <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>.</p>
         <p>The choice of these useful markers has nevertheless been mainly empirical. In fact, their initial development has been almost entirely dependent upon the availability in public databases of human, murine, bovine, and canine sequences for allowing primer design. This historical constraint in marker choice involved that the phylogenetic informativeness of these genes is likely to be non optimal for many of the phylogenetic studies in which they have been used <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. Selecting the genes with the appropriate resolving power for a given phylogenetic problem is a difficult task, and theoretical work has so far provided only limited insight for guiding this choice <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. In practice, however, it has long been realized that there is an optimal evolutionary rate associated with a given phylogenetic question <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, and empirical procedures such as saturation plots <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> have been designed to evaluate the limits in resolving power of a given molecular marker.</p>
         <p>In mammals, the first attempt at specifically selecting multiple nuclear genes for tackling a circumscribed phylogenetic question was made by Murphy and co-workers who specifically targeted genes scattered throughout the mammalian genome to resolve the earliest placental divergences <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Their pragmatic approach was based on the initial selection of exons long enough for easy PCR amplification from whole genomic DNA (> 200 bp) and for which the nucleotide identity between human and mouse ranged between 80 and 95%. This simple procedure was successful at identifying a dozen of new phylogenetically informative nuclear markers for resolving the long-standing question of the evolutionary relationships among placental orders <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>.</p>
         <p>Mammalian phylogenetics is now turning into phylogenomics <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> with such large-scale sequencing initiatives as the ENCODE project <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp> and the NISC Comparative Vertebrate Sequencing Program <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. The availability of mammalian whole genome sequences provides a gold mine for the identification of new phylogenetic markers to further resolve the mammalian tree at different taxonomic levels. However, in this new genomic era, the main problem perhaps resides in the determination of orthology relationships among the different genomes. Bioinformatic tools have been developed for processing whole genome sequences such as INPARANOID <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> and OrthoMCL <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> resulting in dedicated databases of clusters of orthologous groups for eukaryotes <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>. A recent comparison of different orthology detection strategies has shown that phylogenetically based methods perform better than classical similarity search based methods <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Accordingly, the 2007 version of the EnsEMBL database now implements such a phylogenetically-based strategy using maximum likelihood and tree reconciliation methods for orthology assignment among vertebrate genomes <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>.</p>
         <p>In an effort to synthesize all these genomic information, we built upon the EnsEMBL database for constructing a mammalian centred database called OrthoMaM (Orthologous Mammalian Markers). Our aim is to provide a flexible resource for identifying new candidate markers for future use in mammalian phylogenetic studies. Similar approaches based on available genomic data have been recently conducted in plants <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> and ray-finned fishes <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> but they include their own determination of orthology in the corresponding bioinformatic pipelines. By directly relying on the EnsEMBL orthology assessment procedure, our approach has the advantage of being relatively easy to update as more mammalian complete genomes will become available and annotated.</p>
         <p>We focused on orthologous exons rather than on full-length transcripts in order to provide biologists with single continuous fragments potentially amplifiable from genomic DNA. Working with RNA extraction followed by RT-PCR would require a quality of tissue preservation that is not achieved in the vast majority of cases. Moreover, working with genomic DNA avoids the practical problems induced by potential differences of intron length among taxa during the PCR amplification, provided that exons are specifically targeted. We selected individual exons of more than 400 bp long. Increasing this arbitrary threshold might preclude the use of old tissue samples or museum specimens that often contain altered total DNA. Also, lowering this threshold length would involve keeping a total of 7,206 human, murine, and canine exons among which the shorter is only 84 bp long. The minimum length for an exon to be included in the database was thus set up to 400 bp because it offers a reasonable compromise between technical (PCR) constraints, the number of selected candidates, and subsequent sequencing efforts.</p>
         <p>Until now, the choice of phylogenetic markers for mammalian systematics has been governed more by historical constraints than by explicit criteria. This is the reason why we developed a bioinformatics pipeline to derive evolutionary descriptors related to the potential phylogenetic informativeness of each exon. Quantifying the substitution pattern of genes is important to understand the potential biases that might affect phylogenetic inferences <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. Ideally, a good marker would have an optimal evolutionary rate for the given phylogenetic question, equilibrated and homogeneous base frequencies <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>, and homogeneous distribution of site variability <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. Yet the characteristics of a valuable marker depends on the goals of the study it will be used for, and certainly also vary from one investigator to another. Therefore, rather than subjectively selecting a subset of these candidate exons, we provide evolutionary descriptors for all of them. The values of our evolutionary descriptors are indicated for each exon on individual web pages with links to EnsEMBL for full description of the loci. The corresponding sequence alignment and its associated maximum likelihood phylogenetic tree with model parameter estimates are also presented. Note that this phylogeny should be considered cautiously since it might not be optimal in terms of topology because of the use of a suboptimal model (see below), but it should nevertheless provide reliable estimates of model parameters <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. In any case, markers cannot be selected directly from the ML topology they have produced in order to avoid any potential misuse of the database biased by <it>a priori </it>phylogenetic beliefs.</p>
         <p>A number of these evolutionary characteristics can be queried directly through the web-interface. The value and reliability of some of these descriptors are strongly dependent with one another. For example, the global GC content is strongly related to the GC percentage at the third codon position. Moreover, the variance on model parameter estimates will be reduced with longer sequences. The OrthoMaM web-interface allows the user to take these interdependencies into account. On the one hand, one can impose sequences longer than 1000 bp when expecting precise estimates of model parameters. On the other hand, the sequence length may not really matter if the goal is to build a supermatrix with reduced GC bias by combining markers displaying roughly equilibrated base frequencies. Combined queries allow the easy retrieval of orthologous genes that are present in a given number of species for automating phylogenomic supermatrices assembly.</p>
      </sec>
      <sec>
         <st>
            <p>Construction and content</p>
         </st>
         <sec>
            <st>
               <p>Using the EnsEMBL orthology information</p>
            </st>
            <p>The gene annotation available through the EnsEMBL database includes orthology information <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. This annotation is based on phylogenetic analyses of clusters of homologous sequences corresponding to the longest transcript of each gene. Two homologous sequences can be annotated as being paralogue or orthologue depending on their relative position in the corresponding phylogeny. In this phylogeny, when two sequences from different taxa are closer to each other than to any other sequence of the corresponding taxa, they are said to be 1:1 orthologues. Such an orthology assessment is particularly interesting as it avoids markers having similar copies in the genome that can interfere during the amplification process. The quality of the EnsEMBL annotation is ensured by the analysis of a plethora of phylogenies of homologous sequences <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. This annotation requires computation facilities far beyond those available in most laboratories, but allows predicting orthology and paralogy relationships much more accurately than the classical reciprocal best hits approach <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. We therefore exploited this precious annotation for further analyses rather than trying to compete with it.</p>
            <p>The procedure used for detecting candidate markers for placental phylogenetics from whole genomes involves the following steps. First, we focused on human (<it>Homo sapiens</it>), mouse (<it>Mus musculus</it>) and dog (<it>Canis familiaris</it>), whose genomes have been fully sequenced with a high coverage, are well annotated, and are evolutionary divergent enough so that a gene shared by these three taxa is likely to be conserved among placental mammals. We selected genes that are predicted by EnsEMBL to be 1:1 orthologues between <it>Homo</it>:<it>Mus</it>, <it>Homo</it>:<it>Canis </it>and <it>Mus</it>:<it>Canis</it>. Second, for each such gene, we retrieved the longest transcript available for the <it>Homo sapiens </it>gene and for all its 1:1 orthologues among the 12 studied taxa (the latter three, plus <it>Pan troglodytes</it>, <it>Macaca mulatta</it>, <it>Rattus norvegicus</it>, <it>Oryctolagus cuniculus</it>, <it>Bos taurus</it>, <it>Dasypus novemcinctus</it>, <it>Loxodonta africana</it>, <it>Echinops telfairi</it>, and <it>Monodelphis domestica</it>). Then, we considered each exon of the human transcripts and searched for the corresponding orthologous exon in transcripts of other species. We assumed that the most similar exon of the orthologous transcript is actually the orthologous exon provided its sequence shared more than 50% of similarity with the human sequence. This similarity test just checks for the fact that there is an exon that matches with the one currently tested. More difficult problems about gene orthology, gene annotation, or pseudogene occurrence are already handled upstream by the EnsEMBL annotation system <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>.</p>
            <p>All orthologous exons that matched the previous criteria are stored in our database as a set of candidate phylogenetic markers. A total of 3,170 exons among 2,498 genes has been identified, among which the majority is available for 9 taxa (Figure <figr fid="F1">1</figr>). A subset of 118 exons is available for all 12 mammals, and, for fair comparisons among markers, will subsequently constitute the core dataset to illustrate the OrthoMaM properties. This figure is quite low as compared to the 3,170 initial markers available, but is easily explained by the low coverage of recently sequenced genomes. For example, 1,252 candidate markers were found for the 2X-coverage genome of <it>Dasypus novemcinctus</it>, versus 2,623 for the <it>Bos taurus </it>genome sequenced with a more comprehensive 6X-coverage.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Relation between the number of candidate exons and the number of mammalian taxa</p>
               </caption>
               <text>
                  <p><b>Relation between the number of candidate exons and the number of mammalian taxa</b>. Raw and cumulative numbers of exons as a function of the number of taxa for which they are retrieved are respectively given by the histogram and the red circles.</p>
               </text>
               <graphic file="1471-2148-7-241-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>A bioinformatic pipeline to describe the evolutionary dynamics of exons</p>
            </st>
            <p>The following section describes the tools used by our pipeline to estimate the descriptors of the molecular evolutionary properties of the markers. A suite of phylogenetic analyses was conducted to characterize each exonic marker. Since these analyses are time consuming, we relied on a Beowulf-class cluster supercomputer. Dedicated scripts have been developed to parallelize this step so that the database can be regularly updated. This was necessary due to the frequent updates of the EnsEMBL database. Moreover, our scripts are flexible enough to easily integrate new species when they become available as well as phylogenetic software updates. This ensures the upgradeability of our OrthoMaM database.</p>
            <p>First, the DNA sequences of exons were aligned with the help of amino acid translation using the transAlign software <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. Because some exon sequences were shorter than others at 5'- or 3'-extremities (for example because of either lower genomic coverage or too preliminary annotation), alignment extremities were trimmed for sites with missing nucleotides for at least half of the taxa. For a preliminary, fast screening of potential misaligned exons or divergent paralogues, a neighbor-joining (NJ, <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>) analysis on uncorrected pairwise distances was conducted with PAUP* version 4b10 <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. Alignments yielding a NJ tree with a total branch length (TBL) exceeding 2 substitutions per site were discarded.</p>
            <p>Second, base composition of the exons was characterized. For this purpose, we used the relative composition variability (RCV) <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> whose exact formula is:</p>
            <p>
               <display-formula>
                  <m:math name="1471-2148-7-241-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mi>R</m:mi>
                           <m:mi>C</m:mi>
                           <m:mi>V</m:mi>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:munderover>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mrow>
                                          <m:mi>i</m:mi>
                                          <m:mo>=</m:mo>
                                          <m:mn>1</m:mn>
                                       </m:mrow>
                                       <m:mi>t</m:mi>
                                    </m:munderover>
                                    <m:mrow>
                                       <m:mo stretchy="false">(</m:mo>
                                       <m:mo>|</m:mo>
                                       <m:msub>
                                          <m:mi>A</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>&#8722;</m:mo>
                                       <m:msup>
                                          <m:mi>A</m:mi>
                                          <m:mo>&#8727;</m:mo>
                                       </m:msup>
                                       <m:mo>|</m:mo>
                                    </m:mrow>
                                 </m:mstyle>
                                 <m:mo>+</m:mo>
                                 <m:mo>|</m:mo>
                                 <m:msub>
                                    <m:mi>C</m:mi>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msup>
                                    <m:mi>C</m:mi>
                                    <m:mo>&#8727;</m:mo>
                                 </m:msup>
                                 <m:mo>|</m:mo>
                                 <m:mo>+</m:mo>
                                 <m:mo>|</m:mo>
                                 <m:msub>
                                    <m:mi>G</m:mi>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msup>
                                    <m:mi>G</m:mi>
                                    <m:mo>&#8727;</m:mo>
                                 </m:msup>
                                 <m:mo>|</m:mo>
                                 <m:mo>+</m:mo>
                                 <m:mo>|</m:mo>
                                 <m:msub>
                                    <m:mi>T</m:mi>
                                    <m:mi>i</m:mi>
                                 </m:msub>
                                 <m:mo>&#8722;</m:mo>
                                 <m:msup>
                                    <m:mi>T</m:mi>
                                    <m:mo>&#8727;</m:mo>
                                 </m:msup>
                                 <m:mo>|</m:mo>
                                 <m:mo stretchy="false">)</m:mo>
                              </m:mrow>
                              <m:mrow>
                                 <m:mi>s</m:mi>
                                 <m:mo>.</m:mo>
                                 <m:msup>
                                    <m:mi>t</m:mi>
                                    <m:mn>2</m:mn>
                                 </m:msup>
                              </m:mrow>
                           </m:mfrac>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGacaGaaiaabeqaaeqabiWaaaGcbaGaemOuaiLaem4qamKaemOvayLaeyypa0tcfa4aaSaaaeaadaaeWbqaaiabcIcaOiabcYha8jabdgeabnaaBaaabaGaemyAaKgabeaacqGHsislcqWGbbqqdaahaaqabeaacqGHxiIkaaGaeiiFaWhabaGaemyAaKMaeyypa0JaeGymaedabaGaemiDaqhacqGHris5aiabgUcaRiabcYha8jabdoeadnaaBaaabaGaemyAaKgabeaacqGHsislcqWGdbWqdaahaaqabeaacqGHxiIkaaGaeiiFaWNaey4kaSIaeiiFaWNaem4raC0aaSbaaeaacqWGPbqAaeqaaiabgkHiTiabdEeahnaaCaaabeqaaiabgEHiQaaacqGG8baFcqGHRaWkcqGG8baFcqWGubavdaWgaaqaaiabdMgaPbqabaGaeyOeI0Iaemivaq1aaWbaaeqabaGaey4fIOcaaiabcYha8jabcMcaPaqaaiabdohaZjabc6caUiabdsha0naaCaaabeqaaiabikdaYaaaaaaaaa@63FE@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>where <it>A</it><sub><it>i</it></sub>, <it>C</it><sub><it>i</it></sub>, <it>G</it><sub><it>i</it></sub>, <it>T</it><sub><it>i </it></sub>denote the numbers of each nucleotide for taxon <it>i</it>, and <it>A*</it>, <it>C*</it>, <it>G* </it>and <it>T* </it>are averages across the <it>t </it>taxa, and <it>s </it>is the total number of sites. RCV therefore quantifies the extent of overall base composition variability among sequences.</p>
            <p>Third, we used maximum likelihood (ML) analyses <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> to describe and compare the nucleotide substitution pattern among exons. The best fitting model for each of the 3,170 candidate markers was chosen by ModelTest version 3.7 <abbrgrp><abbr bid="B38">38</abbr></abbrgrp> based on the Akaike Information Criterion <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. Among the 3,170 best models identified, we report those that occurred the most frequently (Figure <figr fid="F2">2</figr>). Nineteen different substitution models contribute for best fitting 99% of the exon alignments, and they involve either a &#915; distribution (8 times/19), or a fraction of invariable sites (6/19), or a combination thereof (5/19). The best-fitting model that is most often selected by ModelTest is GTR+&#915;, i.e., the General Time Reversible (GTR) model of nucleotide substitution <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> with substitution rate heterogeneity among sites described by a Gamma distribution of shape &#945; <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. For this reason, and for fair comparison of model parameters among exonic alignments, OrthoMaM reports GTR+&#915; estimates for all candidate markers. Specifically, we used PAUP* to provide users with ML estimates of &#945; parameter, base frequencies, and A-C, A-G, A-T, C-G, and C-T GTR substitution rates between nucleotides (relative to G-T = 1.0).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>A variety of models account for the substitution patterns of the 3170 exons</p>
               </caption>
               <text>
                  <p><b>A variety of models account for the substitution patterns of the 3170 exons</b>. Best-fitting models of sequence evolution selected for 99% of the candidate markers are here illustrated. The 22 other best-fitting models that describe the remaining 1% of the exons are here not represented. Blue, orange, and green colours respectively depict models involving Gamma distribution (&#915;), invariable sites (I), and a combination thereof (&#915;+I). Abbreviations: <it>GTR </it>(General Time Reversible), <it>TVM </it>(Transversional model), <it>HKY </it>(Hasegawa, Kishino, Yano 85), <it>TrN </it>(Tamura-Nei 93), <it>TIM </it>(Transitional model), <it>K81 </it>(Kimura 81), and <it>SYM </it>(Symmetrical model); <it>ef </it>(equal base frequencies), and <it>uf </it>(unequal base frequencies).</p>
               </text>
               <graphic file="1471-2148-7-241-2"/>
            </fig>
            <p>Fourth, an important descriptor of the utility of a phylogenetic marker is its relative evolutionary rate: faster (respectively slower) evolving markers will be more suitable for lower (respectively deeper) taxonomic levels. In a first approximation, the TBL of the highest-likelihood tree is a reasonable descriptor of the evolutionary rate of a given exon. However, the TBL will preclude fair comparisons among different exons when the taxon sampling differs: the higher the species number, the longer the TBL. To circumvent this problem, we used the Super Distance Matrix (SDM) approach <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>, with a three-step procedure: (i) The ML tree inferred from each of the 3170 exons was converted into a matrix of additive distances by computing the path-length between each pair of species. (ii) Each of the 3170 matrices was brought closer to the others by a factor (<it>&#945;</it><sub><it>p</it></sub>), according to the least-squares criterion; this operation is equivalent to multiplying by <it>&#945;</it><sub><it>p </it></sub>every branch length of the initial trees. (iii) Optimal values of the <it>&#945;</it><sub><it>p </it></sub>parameters are calculated following SDM* in reference <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>, and, as <it>&#945;</it><sub><it>p </it></sub>are inversely proportional to the evolutionary rates, 1/<it>&#945;</it><sub><it>p </it></sub>values provide a measure of rate heterogeneities among exons even if the number of taxa differs. In addition, the quality of the highest-likelihood tree was also measured through its treeness, i.e., the relative contribution of the sum of internal branches to the TBL <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. If a star tree is inferred from a given marker, this lack of phylogenetic signal will be reflected by a treeness value of zero. Conversely, the higher the treeness, the better the resolution of the internal parts of the tree will be.</p>
            <p>In order to better evaluate the contrast level of evolutionary dynamics among codon positions, some calculations were also conducted separately on first, second, and third codon positions. The distribution of variability over the three codon positions was evaluated for each exonic marker by calculating the contribution of first, second, and third codon positions relative to the total number of variable sites. This allows distinguishing between exons with variability concentrated on third positions versus exons with a more even distribution over the three codon positions. Such a distinction might be useful for maximizing the number of phylogenetically informative characters when selecting an exon for further sequencing in a given taxonomic group. For example, the widely used exon 11 of <it>BRCA1 </it>has been shown to have an almost equal distribution in the number of substitutions among the three codon positions <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp>. This property associated with a relatively slow overall substitution rate made <it>BRCA1 </it>a particularly informative marker for resolving placental mammal earliest divergences <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Utility and discussion</p>
         </st>
         <sec>
            <st>
               <p>User interface</p>
            </st>
            <p>To search the OrthoMaM database, a request form is available (Figure <figr fid="F3">3</figr>). A range of values of the evolutionary dynamics descriptors can be given, either independently or in combination. This includes the number of species for which the orthologous exons are available (set to a maximum of 12 for the current version), the relative evolutionary rate of the markers (as measured by the SDM procedure on highest-likelihood trees), their level of among-site substitution rate heterogeneity as measured by the &#945; shape of the &#915; distribution, and their percent of GC at third codon positions. Moreover, this query can be associated with a query about genomic descriptors including gene symbol, human-murine-canine chromosome numbers, and exon length for the three reference species <it>Homo</it>, <it>Mus </it>and <it>Canis</it>. The result of the query is visualized as a recapitulative table of the different descriptors with link to the alignment in Fasta format (Figure <figr fid="F4">4</figr>). Once the marker is chosen by the user, the synopsis of all its descriptors can also be obtained with direct links to EnsEMBL for details on each individual sequence (Figure <figr fid="F5">5</figr>).</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Screenshot of the query form of OrthoMaM</p>
               </caption>
               <text>
                  <p><b>Screenshot of the query form of OrthoMaM</b>. In this example, the user requests orthologous exons available for all 12 mammals, characterized by a relative evolutionary rate ranging from 0.8 to 1.2, and a length ranging from 800 to 1,500 bp for human, mouse, and dog.</p>
               </text>
               <graphic file="1471-2148-7-241-3"/>
            </fig>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Screenshot of the result sheet</p>
               </caption>
               <text>
                  <p><b>Screenshot of the result sheet</b>. The result of the query previously submitted (see Figure 3) is shown. Fourteen candidate exons are recovered with a recapitulation of their phylogenetic descriptors.</p>
               </text>
               <graphic file="1471-2148-7-241-4"/>
            </fig>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Screenshot of the recapitulation of the evolutionary descriptors of <it>CHGUT_HUMAN</it></p>
               </caption>
               <text>
                  <p><b>Screenshot of the recapitulation of the evolutionary descriptors of <it>CHGUT_HUMAN</it></b>. Details about the available exon of each mammal are presented, including EnsEMBL gene/transcript/exon identifiers, chromosome location, and chromosome coordinates. Phylogenetic information is also provided, including base frequencies, GTR matrix of the relative substitution rates between nucleotides, various evolutionary descriptors, and a summary of site variability. The alignment and the corresponding highest-likelihood tree are also given (not shown here).</p>
               </text>
               <graphic file="1471-2148-7-241-5"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Evolutionary descriptors</p>
            </st>
            <p>To illustrate the importance of providing various descriptors of the evolutionary dynamics of the 3,170 exonic markers currently included in the OrthoMaM database, we summarized the range of values for some of them (Table <tblr tid="T1">1</tblr>). For fair comparison among markers with respect to missing taxa, only descriptor values for the 118 exons present in all 12 species are here presented. People interested in getting the maximum number of molecular characters from a single PCR amplification will focus on longer exons. For example, the alignment of the longest conserved exon (EnsEMBL gene ENSG00000189079: AT rich interactive domain 2 gene [<it>ARID2</it>], human exon 15) spans ~2.8 kb among the 12 mammals. On average, exonic alignments contain one-third (35.7%) of variable sites, among which two-third (68.4%) occur at third codon positions. Some exons retain > 90% of their variability at third codon positions (the maximum is 93.4%). Others possess a more evenly distributed variability, with a maximum variability of 34.4% on first codon positions, and 28.6% at second codon positions. Evenly distributed site variability is associated with a high value of the &#945;-shape parameter and potentially reduces the level of homoplasy as nucleotide substitutions could accumulate at each codon position over the whole exon <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. About base composition, the mean GC3 is 58.7%, with a very wide range from 29.3% to 89.3%. Nucleotide substitution patterns are fairly variable as well, with e.g. the relative C-T substitution rate ranging from 1.1 to 34.5, and the average transition/transversion rate ratio ranging from 0.8 to 6.5. Moreover, the relative evolutionary rate among exons, as estimated by SDM, exhibited a more than 10-fold variation between slowest- and fastest-evolving candidates.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Descriptors of the evolutionary dynamics of the 118 exons available for 12 mammals (human, chimp, macaque, mouse, rat, rabbit, cow, dog, elephant, tenrec, armadillo, and opossum).</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="center">
                        <p>
                           <b>DESCRIPTORS</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MEAN</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>SE</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MIN</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>MAX</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Exon length</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>800</p>
                     </c>
                     <c ca="center">
                        <p>465</p>
                     </c>
                     <c ca="center">
                        <p>405</p>
                     </c>
                     <c ca="center">
                        <p>2883</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Variability</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>35.7</p>
                     </c>
                     <c ca="center">
                        <p>10.0</p>
                     </c>
                     <c ca="center">
                        <p>15.7</p>
                     </c>
                     <c ca="center">
                        <p>60.6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>% var 1</b>
                           <sup>st</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>19.5</p>
                     </c>
                     <c ca="center">
                        <p>6.2</p>
                     </c>
                     <c ca="center">
                        <p>5.9</p>
                     </c>
                     <c ca="center">
                        <p>34.4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>% var 2</b>
                           <sup>nd</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>12.1</p>
                     </c>
                     <c ca="center">
                        <p>7.1</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>28.6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>% var 3</b>
                           <sup>rd</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>68.4</p>
                     </c>
                     <c ca="center">
                        <p>12.8</p>
                     </c>
                     <c ca="center">
                        <p>42.0</p>
                     </c>
                     <c ca="center">
                        <p>93.4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>&#945; (&#915; distribution)</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.4</p>
                     </c>
                     <c ca="center">
                        <p>0.2</p>
                     </c>
                     <c ca="center">
                        <p>0.1</p>
                     </c>
                     <c ca="center">
                        <p>1.3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>%GC</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>52.8</p>
                     </c>
                     <c ca="center">
                        <p>7.5</p>
                     </c>
                     <c ca="center">
                        <p>39.5</p>
                     </c>
                     <c ca="center">
                        <p>69.4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>%GC3</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>58.7</p>
                     </c>
                     <c ca="center">
                        <p>15.2</p>
                     </c>
                     <c ca="center">
                        <p>29.3</p>
                     </c>
                     <c ca="center">
                        <p>89.3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>RCV (&#215; 1000)</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>6.8</p>
                     </c>
                     <c ca="center">
                        <p>2.9</p>
                     </c>
                     <c ca="center">
                        <p>1.7</p>
                     </c>
                     <c ca="center">
                        <p>16.3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>A-C</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1.5</p>
                     </c>
                     <c ca="center">
                        <p>0.9</p>
                     </c>
                     <c ca="center">
                        <p>0.4</p>
                     </c>
                     <c ca="center">
                        <p>6.7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>A-G</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>6.1</p>
                     </c>
                     <c ca="center">
                        <p>3.4</p>
                     </c>
                     <c ca="center">
                        <p>1.4</p>
                     </c>
                     <c ca="center">
                        <p>24.4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>A-T</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1.0</p>
                     </c>
                     <c ca="center">
                        <p>0.7</p>
                     </c>
                     <c ca="center">
                        <p>0.2</p>
                     </c>
                     <c ca="center">
                        <p>3.8</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>C-G</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1.1</p>
                     </c>
                     <c ca="center">
                        <p>0.7</p>
                     </c>
                     <c ca="center">
                        <p>0.1</p>
                     </c>
                     <c ca="center">
                        <p>3.6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>C-T</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>7.7</p>
                     </c>
                     <c ca="center">
                        <p>4.8</p>
                     </c>
                     <c ca="center">
                        <p>1.1</p>
                     </c>
                     <c ca="center">
                        <p>34.5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>G-T</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1.0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>1.0</p>
                     </c>
                     <c ca="center">
                        <p>1.0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Ti/Tv</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>2.8</p>
                     </c>
                     <c ca="center">
                        <p>0.9</p>
                     </c>
                     <c ca="center">
                        <p>0.8</p>
                     </c>
                     <c ca="center">
                        <p>6.5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>TBL</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>0.9</p>
                     </c>
                     <c ca="center">
                        <p>0.4</p>
                     </c>
                     <c ca="center">
                        <p>0.2</p>
                     </c>
                     <c ca="center">
                        <p>1.9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <b>Relative rate (SDM)</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1.0</p>
                     </c>
                     <c ca="center">
                        <p>0.4</p>
                     </c>
                     <c ca="center">
                        <p>0.2</p>
                     </c>
                     <c ca="center">
                        <p>2.6</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Mean, standard-error (SE), minimum (MIN) and maximum (MAX) values are given for exon length, variability (% of variable sites on the complete alignment), relative variability (% var) among codon positions (% of the variable sites that respectively occurred on first [1st], second [2nd], and third [3rd] codon positions), substitution rate heterogeneity among sites (&#945;), Guanosine + Cytosine content on all (GC) and third (GC3) codon positions, relative composition variability (RCV), relative GTR substitution rates (A-C to G-T), average transition/transversion rate ratio (Ti/Tv), total branch length (TBL) of the tree, and relative evolutionary rate (as measured by the SDM procedure).</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Development of new phylogenetic markers</p>
            </st>
            <p>We illustrate the potential utility of the OrthoMaM database with the development of two new markers for placental phylogenetics. We focused on the 118 candidates retrieved for all 12 mammals, and we chose among exons with a length ranging between 800&#8211;1500 bp for the three-species core (human, mouse and dog), with an intermediate relative rate of evolution, i.e., SDM value ranging from to 0.8 to 1.2 (Figure <figr fid="F3">3</figr>). Fourteen candidate exons satisfied the combination of these three criteria, among which <it>CHGUT_HUMAN </it>(1,308 bp) and <it>NCOA1 </it>(1,347 bp) were the longest (Figure <figr fid="F4">4</figr>). We thus selected the corresponding alignments containing sequences that are orthologous to exon 4 of the human CHondroitin sulfate GlucUronylTransferase gene (EnsEMBL gene reference ENSG00000033100), and to exon 11 of the human Nuclear receptor CO-Activator 1 gene (ENSG00000084676).</p>
            <p>Our <it>in silico </it>approach for development of new markers was then validated by the successful amplification and sequencing of <it>CHGUT_HUMAN </it>exon 4 orthologues for species belonging to two of the most evolutionary distant groups of placental mammals: xenarthrans (member of Atlantogenata, the clade of southern origin), and rodents (member of Boreoeutheria, the clade of northern origin) <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. The xenarthran species was the anteater <it>Tamandua tetradactyla</it>. The rodent species were a caviomorph (the degu, <it>Octodon degu</it>), a sciurid (the Guianan squirrel, <it>Sciurus aestuans</it>), a dipodid (the lesser Egyptian jerboa, <it>Jaculus jaculus</it>), and two sigmodontine muroids (the MacConnell's rice rat, <it>Oryzomys macconnelli</it>, and the Guiana bristly mouse, <it>Neacomys guianae</it>). Similarly, we obtained sequences of <it>NCOA1 </it>exon 11 orthologues for rodents, including <it>Octodon</it>, <it>Jaculus</it>, <it>Oryzomys</it>, <it>Neacomys</it>, and also a glirid (the garden dormouse, <it>Eliomys quercinus</it>) and an anomalurid (a scaly-tailed flying squirrel, <it>Anomalurus </it>sp.).</p>
            <p>All ethanol preserved tissues were extracted using QIAamp DNAminikit following manufacturer (<it>QIAGEN</it>) instructions. A 1.1 kb portion of <it>CHGUT_HUMAN </it>exon 4 orthologues was amplified by Polymerase Chain Reaction (PCR) with forward 1F (5'-GCYCAGATCCGGAACCTGAC-3') and reverse 1R (5'-AACCGGAGGAAAACATCCATCACC-3') primers. A 1.2 kb portion of <it>NCOA1 </it>exon 11 orthologues was amplified with forward 1F (5'-CAGTGGCCTTTCTCCTCAAG-3') and reverse 1R (5'-ACCTTTACRTCATCCAGGC-3') primers. Amplification reactions were carried out in 20 &#956;l including 50 &#956;M of each primer, dNTP (200 &#956;M), 1&#215; Taq buffer, 1 U Taq polymerase (<it>Triple Master PCR System Eppendorf</it>) and 50&#8211;100 ng genomic DNA. Amplifications were performed in Mastercycler gradient (<it>Eppendorf</it>) using denaturation at 94&#176;C (4 min), followed by 29 temperature cycles of 94&#176;C (20 sec), 53&#176;C to 60&#176;C (20 sec) and 72&#176;C (1 min 30 sec), with a final extension at 72&#176;C (10 min). The temperature gradient was necessary for PCR optimisation on all taxa. PCR products were purified from 1% agarose gels with the DNA gel extraction kit (<it>Millipore</it>) and directly sequenced using the 1F/1R external primers, and two internal ones: 3F (5'-GTGGARATCCTGCCYATGCC-3') and 2R (5-CACCTGGGAMGGKGCCTC-3') for <it>CHGUT_HUMAN</it>, and 2F (5'-CAAACAATTCATTTCCTCC-3') and 2R (5'-GCATGCCGTAACTGCTG-3') for <it>NCOA1</it>. The Bigdye Terminator kit v1.1 (<it>Applied Biosystem</it>) was used and sequencing reactions were run on an ABI 310 (<it>Applied Biosystem</it>) automated sequencer. Sequences have been deposited in the EMBL database under accession numbers <ext-link ext-link-type="embl" ext-link-id="AM900835">AM900835</ext-link> to <ext-link ext-link-type="embl" ext-link-id="AM900846">AM900846</ext-link>.</p>
            <p>Newly obtained <it>CHGUT_HUMAN </it>and <it>NCOA1 </it>sequences were added to the ones of the 12 mammals included in the OrthoMaM database, complemented by other species downloaded from ongoing genome projects and traces (<it>Equus caballus</it>, <it>Myotis lucifugus</it>, <it>Felis catus</it>, <it>Tupaia belangeri</it>, <it>Otolemur garnettii</it>, <it>Cavia porcellus</it>, <it>Spermophilus tridecemlineatus</it>, and <it>Dipodomys ordii</it>). Maximum likelihood analyses of the final alignments under the best-fitting GTR+&#915; model yielded trees conforming to the current view of the placental phylogeny (Figure <figr fid="F6">6</figr>). The four major clades Afrotheria, Xenarthra, Laurasiatheria, and Euarchontoglires are recovered. Among the latter group, the rabbit (Lagomorpha) is the sister group of rodents, forming the Glires clade. Monophyletic rodents subdivide into three subclades respectively containing Guinea-pig related, squirrel-related, and mouse-related species, in agreement with recent multigene phylogenies of Rodentia <abbrgrp><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr></abbrgrp>.</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Maximum likelihood phylogenies reconstructed from alignments of orthologues to the human CHondroitin sulfate GlucUronylTransferase (<it>CHGUT_HUMAN</it>) exon 4 [left], and to the human Nuclear receptor CO-Activator 1 gene (<it>NCOA1</it>) exon 11 [right]</p>
               </caption>
               <text>
                  <p><b>Maximum likelihood phylogenies reconstructed from alignments of orthologues to the human CHondroitin sulfate GlucUronylTransferase (<it>CHGUT_HUMAN</it>) exon 4 [left], and to the human Nuclear receptor CO-Activator 1 gene (<it>NCOA1</it>) exon 11 [right]</b>. Values on nodes are ML bootstrap support. Taxa for which the <it>CHGUT_HUMAN </it>and <it>NCOA1 </it>exons were obtained in vivo are indicated in bold + asterisk. Sequences from other taxa were recovered <it>in silico </it>from traces, pre-assembled, and assembled EnsEMBL genomes. The name of major clades is provided on the right, and blue rectangles correspond to Euarchontoglires mammals. The outgroup <it>Monodelphis </it>is drawn with midpoint rooting. Horizontal branch lengths are proportional to the DNA divergence (same scale for both exons = 0.1 nucleotide substitutions per site). Maximum likelihood details about <it>CHGUT_HUMAN</it>|<it>NCOA1 </it>phylograms are respectively as follows: log-likelihoods are ln<it>L </it>= -8,868.8|8,899.4, and estimates of model parameters are: %A = 15.6|29.2, %C = 32.6|27.5, %G = 33.0|21.2, and %T = 18.8|22.1 for base frequencies ; A-C = 1.11|0.63, A-G = 4.61|4.04, A-T = 1.38|0.68, C-G = 0.44|0.54, C-T = 4.17|3.31, and G-T = 1.00 for GTR relative substitution rates ; and &#945; = 0.30|0.51 for rate heterogeneity among sites.</p>
               </text>
               <graphic file="1471-2148-7-241-6"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>A phylogenomic approach on placental mammals</p>
            </st>
            <p>As a second illustration of the utility of the OrthoMaM database, we used it in a phylogenomic perspective. To that aim, we combined the 118 exons present for all 12 species (human, chimp, macaque, mouse, rat, rabbit, cow, dog, elephant, tenrec, armadillo, and opossum), and obtained a supermatrix of 94,739 sites. We only used exons detected in these 12 mammals by our automatic, bioinformatic pipeline, to minimize the amount of missing character states which is as low as 4.8% due to incomplete 5'/3' coverage and gapped sites. The highest-likelihood topology calculated by PAUP* was well-resolved, as evaluated through ML bootstrap support, except for the position of the placental root (Figure <figr fid="F7">7</figr>). Rodents (<it>Mus </it>and <it>Rattus</it>) and the lagomorph (<it>Oryctolagus</it>) are grouped into Glires as a sister-group of catarrhine primates (<it>Homo</it>, <it>Pan </it>and <it>Macaca</it>) to form Euarchontoglires. Euarchontoglires branch with Laurasiatheria (here represented by <it>Bos </it>and <it>Canis</it>) into Boreoeutheria, all with 100% bootstrap support.</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>Maximum likelihood phylogeny reconstructed from a 94-kb alignment of 118 concatenated orthologous exons present for all 12 mammals</p>
               </caption>
               <text>
                  <p><b>Maximum likelihood phylogeny reconstructed from a 94-kb alignment of 118 concatenated orthologous exons present for all 12 mammals</b>. The log-likelihood of the phylogram is ln<it>L </it>= -407,623.4, and estimates of model parameters are: %A = 25.9, %C = 27.0, %G = 25.0, and %T = 22.1 for base frequencies ; A-C = 1.29, A-G = 4.69, A-T = 0.72, C-G = 1.21, C-T = 5.75, and G-T = 1.00 for GTR relative substitution rates ; INV = 31.7% for the fraction of invariable sites, and &#945; = 0.74 for rate heterogeneity among the remaining sites. Values on nodes are ML bootstrap support. Sequences from the 12 taxa were <it>in silico </it>recovered from assembled EnsEMBL genomes. The outgroup <it>Monodelphis </it>is drawn with midpoint rooting. Horizontal branch lengths are proportional to the DNA divergence (scale = 0.05 nucleotide substitutions per site).</p>
               </text>
               <graphic file="1471-2148-7-241-7"/>
            </fig>
            <p>Our topology is thus fully compatible with the new understanding of placental mammal phylogeny revealed by early multigene analyses to the exception of the position of the root <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B16">16</abbr><abbr bid="B49">49</abbr></abbrgrp>. However, other recent studies based on phylogenomic data sets claimed support for a closer evolutionary relationship between primates and carnivores relative to muroid rodents <abbrgrp><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr></abbrgrp>. Such results appear all the more surprising given that the monophyly of rodents plus primates relative to carnivores and cetartiodactyls observed in Figure <figr fid="F7">7</figr> is also supported by a large body of independent evidences including multigene mitochondrial and nuclear DNA phylogenies <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B44">44</abbr><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr><abbr bid="B55">55</abbr></abbrgrp>, indel protein signatures <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>, and SINEs insertions <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>.</p>
            <p>The main commonality among these three recent phylogenomic studies is their use of reduced taxon samplings associated with whole genome sequences, a situation where statistical phylogenetic inconsistency is particularly prone to occur <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B58">58</abbr></abbrgrp>. Interestingly, in our dataset, the evolutionary rate of muroid rodents appears 2.7 times faster than the one of primates as attested by relative branch lengths of the ML phylogram (Figure <figr fid="F7">7</figr>). Nevertheless, <it>Mus</it>, <it>Rattus </it>and <it>Oryctolagus </it>grouped with <it>Macaca</it>, <it>Homo </it>and <it>Pan</it>, whereas a long-branch attraction (LBA) phenomenon <abbrgrp><abbr bid="B59">59</abbr></abbrgrp> would have attracted muroids towards the distant marsupial outgroup.</p>
            <p>In order to test for this LBA hypothesis, we restricted our analysis to the 8-taxon sampling of Cannarozzi et al. <abbrgrp><abbr bid="B50">50</abbr></abbrgrp> including only human, chimp, macaque, mouse, rat, cow, dog, and opossum. The use of a sub-optimal and under-parameterized model, i.e. GTR without &#915;+INV, led to a ML topology conforming to the results obtained by Cannarozzi et al. <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>: <it>Mus </it>+ <it>Rattus </it>branched to the most basal position among placentals, with a 99% bootstrap support for grouping <it>Bos </it>+ <it>Canis </it>with primates. However, under the better-fitting GTR+&#915;+INV model, the ML analysis recovered with 100% bootstrap support the initial topology (<it>cf</it>. Figure <figr fid="F7">7</figr>) in which primates and rodents are grouped together to the exclusion of carnivores + cetartiodactyls. Here, the more sophisticated model seems to correct the long-branch attraction artefact of muroid rodents towards the opossum branch.</p>
            <p>Further investigations were conducted by adding the rabbit in order to break the long isolated muroid branch. Under the GTR model (without &#915;+INV), Glires appeared monophyletic and branched with primates to the exclusion of <it>Canis </it>and <it>Bos</it>, and the ML bootstrap support for the euarchontoglires clade was 77%. A denser taxon sampling (i.e., the addition of <it>Oryctolagus</it>) therefore reduces the LBA phenomenon, and here compensates for model underparameterization. The use of the better-fitting GTR+&#915;+INV model confirmed this trend, and provided 100% bootstrap support for grouping rodents with primates. Finally, the reanalysis of the 12-taxon OrthoMaM supermatrix of 118 exons under a GTR model but without &#915;+INV yielded 100% bootstrap within boreoeutherians for the topology of Figure <figr fid="F7">7</figr>. These analyses confirm the crucial impact of taxon sampling for accurate phylogenetic inference, especially when long isolated branches are involved <abbrgrp><abbr bid="B58">58</abbr><abbr bid="B60">60</abbr></abbrgrp>. Moreover, accounting for among-sites rate variation through a Gamma distribution is important to accurately discriminate among alternative topologies, especially when the taxon sampling is depauperate <abbrgrp><abbr bid="B61">61</abbr><abbr bid="B62">62</abbr></abbrgrp>.</p>
            <p>From our ML analyses, it seems that the basal position of rodents observed in the three recent phylogenomic studies <abbrgrp><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr></abbrgrp> is likely a LBA artefact associated with the use of a reduced taxon sampling and/or inadequate phylogenetic reconstruction methods. Our study strongly supports the phylogenetic affinities of rodents with primates to the exclusion of carnivores (Figure <figr fid="F7">7</figr>), and adds credit to the view that they should no longer be considered as contentious <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>. The same phylogenetic relationships among rodents, primates and carnivores is obtained from a GTR+&#915; ML analysis of the supermatrix of characters resulting from the combination of all 3,170 exons of the OrthoMaM database (3,047,860 sites for 12 taxa ; 32% missing character states ; results not shown).</p>
            <p>The only unresolved node in our phylogenomic analysis (Figure <figr fid="F7">7</figr>) involves the position of the root of the placental tree. There has been debate to know whether xenarthrans are sister-group of all remaining placentals (the Epitheria hypothesis) <abbrgrp><abbr bid="B57">57</abbr><abbr bid="B64">64</abbr></abbrgrp>, or branch with Afrotheria (the Atlantogenata hypothesis; <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>), or with Boreoeutheria (the basal Afrotheria hypothesis <abbrgrp><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr></abbrgrp>). Here, the combination of the 118 exons yielded bootstrap support of 45% for Atlantogenata (the highest-likelihood branching pattern), 46% for Epitheria, and 9% for basal Afrotheria. Indel signatures <abbrgrp><abbr bid="B46">46</abbr></abbrgrp> and larger datasets <abbrgrp><abbr bid="B66">66</abbr><abbr bid="B67">67</abbr></abbrgrp> actually seem to favour the Afrotheria + Xenarthra branching. However, it has been argued that the latter two results might reflect an artefact of using concatenated likelihood models whereas partitioned models rather favour the basal Afrotheria hypothesis <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>The OrthoMaM database provides an intuitive interface for querying thousand of orthologous exons of potential use in placental mammal systematics. It also allows the easy retrieval of large sets of conserved orthologous exons among the available mammalian genomes to perform phylogenomic analyses. The evolutionary descriptors characterizing each candidate marker are of particular interest for phylogenetic marker choice and for comparative analyses of molecular evolution at the genome scale. The bioinformatic pipeline behind OrthoMaM allows envisioning that the database will be dynamically updated on a regular basis to follow the evolution of EnsEMBL and thereby ensure its accuracy. We expect the OrthoMaM database to prove useful for further resolving the phylogenetic tree of mammals and for understanding the selective pressures that shaped the evolution of their genomes.</p>
      </sec>
      <sec>
         <st>
            <p>Availability and requirements</p>
         </st>
         <p>Project name: OrthoMaM (<it>Ortho</it>logous <it>Ma</it>mmalian <it>M</it>arkers);</p>
         <p>Project home page: <url>http://kimura.univ-montp2.fr/orthomam</url>;</p>
         <p>Operating system(s): Platform independent;</p>
         <p>Programming language: Java, PHP, XML, XSLT;</p>
         <p>License: None;</p>
         <p>Any restrictions to use by non-academics: None.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>VR and EJPD initiated the study, conceived and implemented the bioinformatic pipeline. VR, SR and KB constructed the database and designed the web interface. MKT carried out PCR amplifications and sequencing. EJPD, FD and MKT performed the phylogenetic analyses for the two case studies. VR, FD and EJPD wrote the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Nicolas Galtier, and two anonymous reviewers for helpful comments on the manuscript, and Fran&#231;ois Catzeflis for providing tissue samples. RCV calculation took advantage of a Perl script written by Olivier Jeffroy. Master students Michael Abrouk, Jacques-Mathieu Dainat, Nelly Rozas, Lucile Soler and Claire Poiron are also thanked for their contribution. This work has been supported by the Research Networks Program in BIOINFORMATICS of the High Council for Scientific and Technological Cooperation between France and Isra&#235;l. This publication is the contribution N&#176;2007-098 of the Institut des Sciences de l'Evolution (UMR 5554 &#8211; Universit&#233; Montpellier 2/CNRS).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Evolution of the cytochrome b gene of mammals</p>
            </title>
            <aug>
               <au>
                  <snm>Irwin</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Kocher</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>AC</fnm>
               </au>
            </aug>
            <source>Journal of Molecular Evolution</source>
            <pubdate>1991</pubdate>
            <volume>32</volume>
            <fpage>128</fpage>
            <lpage>144</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF02515385</pubid>
                  <pubid idtype="pmpid">1901092</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Secondary structure and patterns of evolution among mammalian mitochondrial 12S rRNA molecules</p>
            </title>
            <aug>
               <au>
                  <snm>Springer</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Douzery</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Journal of Molecular Evolution</source>
            <pubdate>1996</pubdate>
            <volume>43</volume>
            <fpage>357</fpage>
            <lpage>373</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8798341</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>The guinea-pig is not a rodent</p>
            </title>
            <aug>
               <au>
                  <snm>D'Erchia</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Gissi</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Pesole</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Saccone</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Arnason</snm>
                  <fnm>U</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1996</pubdate>
            <volume>381</volume>
            <fpage>597</fpage>
            <lpage>600</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/381597a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">8637593</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>A molecular perspective on mammalian evolution from the gene encoding interphotoreceptor retinoid binding protein, with convincing evidence for bat monophyly</p>
            </title>
            <aug>
               <au>
                  <snm>Stanhope</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Czelusniak</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Si</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Nickerson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Goodman</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Molecular Phylogenetics and Evolution</source>
            <pubdate>1992</pubdate>
            <volume>1</volume>
            <fpage>148</fpage>
            <lpage>160</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/1055-7903(92)90026-D</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Evidence on mammalian phylogeny from sequences of exon 28 of the von Willebrand factor gene</p>
            </title>
            <aug>
               <au>
                  <snm>Porter</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Goodman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stanhope</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Molecular Phylogenetics and Evolution</source>
            <pubdate>1996</pubdate>
            <volume>5</volume>
            <fpage>89</fpage>
            <lpage>101</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1006/mpev.1996.0008</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Parallel adaptive radiations in two major clades of placental mammals</p>
            </title>
            <aug>
               <au>
                  <snm>Madsen</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Scally</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Douady</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Kao</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>DeBry</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Adkins</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Amrine</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Stanhope</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>de Jong</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Springer</snm>
                  <fnm>MS</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>409</volume>
            <fpage>610</fpage>
            <lpage>614</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35054544</pubid>
                  <pubid idtype="pmpid" link="fulltext">11214318</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Evaluating placental inter-ordinal phylogenies with novel sequences including RAG1, gamma-fibrinogen, ND6, and mt-tRNA, plus MCMC-driven nucleotide, amino acid, and codon models</p>
            </title>
            <aug>
               <au>
                  <snm>Waddell</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Shelley</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Molecular Phylogenetics and Evolution</source>
            <pubdate>2003</pubdate>
            <volume>28</volume>
            <fpage>197</fpage>
            <lpage>224</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/S1055-7903(03)00115-5</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Endemic African mammals shake the phylogenetic tree</p>
            </title>
            <aug>
               <au>
                  <snm>Springer</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Cleven</snm>
                  <fnm>GC</fnm>
               </au>
               <au>
                  <snm>Madsen</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>de Jong</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Waddell</snm>
                  <fnm>VG</fnm>
               </au>
               <au>
                  <snm>Amrine</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Stanhope</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1997</pubdate>
            <volume>388</volume>
            <fpage>61</fpage>
            <lpage>64</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/40386</pubid>
                  <pubid idtype="pmpid" link="fulltext">9214502</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Classification and molecular phylogeny</p>
            </title>
            <aug>
               <au>
                  <snm>Montgelard</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Douzery</snm>
                  <fnm>EJP</fnm>
               </au>
               <au>
                  <snm>Michaux</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Reproductive biology and phylogeny of Cetacea (whales, dolphins and porpoises)</source>
            <publisher>Enfield (New Hampshire, USA) , Science Publishers</publisher>
            <editor>Jamieson BGM</editor>
            <pubdate>2007</pubdate>
            <volume>7</volume>
            <fpage>95</fpage>
            <lpage>125</lpage>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Molecular phylogenetic evidence confirming the Eulipotyphla concept and in support of hedgehogs as the sister group to shrews</p>
            </title>
            <aug>
               <au>
                  <snm>Douady</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Chatelier</snm>
                  <fnm>PI</fnm>
               </au>
               <au>
                  <snm>Madsen</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>de Jong</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Catzeflis</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Springer</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Stanhope</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Molecular Phylogenetics and Evolution</source>
            <pubdate>2002</pubdate>
            <volume>25</volume>
            <fpage>200</fpage>
            <lpage>209</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/S1055-7903(02)00232-4</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Profiling phylogenetic informativeness</p>
            </title>
            <aug>
               <au>
                  <snm>Townsend</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Systematic Biology</source>
            <pubdate>2007</pubdate>
            <volume>56</volume>
            <fpage>222</fpage>
            <lpage>231</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/10635150701311362</pubid>
                  <pubid idtype="pmpid" link="fulltext">17464879</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Phylogenetic information and experimental design in molecular systematics</p>
            </title>
            <aug>
               <au>
                  <snm>Goldman</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Proc Biol Sci</source>
            <pubdate>1998</pubdate>
            <volume>265</volume>
            <fpage>1779</fpage>
            <lpage>1786</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1689363</pubid>
                  <pubid idtype="pmpid" link="fulltext">9787470</pubid>
                  <pubid idtype="doi">10.1098/rspb.1998.0502</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>On the best evolutionary rate for phylogenetic analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Systematic Biology</source>
            <pubdate>1998</pubdate>
            <volume>47</volume>
            <fpage>125</fpage>
            <lpage>133</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/106351598261067</pubid>
                  <pubid idtype="pmpid">12064232</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Evaluating the phylogenetic utility of genes - a search for genes informative about deep divergences among vertebrates</p>
            </title>
            <aug>
               <au>
                  <snm>Graybeal</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Systematic Biology</source>
            <pubdate>1994</pubdate>
            <volume>43</volume>
            <fpage>174</fpage>
            <lpage>193</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/2413460</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Comparison of molecular and paleontological data in diatoms suggests a major gap in the fossil record</p>
            </title>
            <aug>
               <au>
                  <snm>Philippe</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Sorhannus</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Baroin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Perasso</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gasse</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Adoutte</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>J Evol Biol</source>
            <pubdate>1994</pubdate>
            <volume>7</volume>
            <fpage>247</fpage>
            <lpage>265</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1046/j.1420-9101.1994.7020247.x</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Molecular phylogenetics and the origins of placental mammals</p>
            </title>
            <aug>
               <au>
                  <snm>Murphy</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Eizirik</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>WE</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>YP</fnm>
               </au>
               <au>
                  <snm>Ryder</snm>
                  <fnm>OA</fnm>
               </au>
               <au>
                  <snm>O'Brien</snm>
                  <fnm>SJ</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>409</volume>
            <fpage>614</fpage>
            <lpage>618</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35054550</pubid>
                  <pubid idtype="pmpid" link="fulltext">11214319</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Mammalian phylogenomics comes of age</p>
            </title>
            <aug>
               <au>
                  <snm>Murphy</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Pevzner</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>O'Brien</snm>
                  <fnm>SJ</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>631</fpage>
            <lpage>639</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2004.09.005</pubid>
                  <pubid idtype="pmpid" link="fulltext">15522459</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>The ENCODE Project at UC Santa Cruz</p>
            </title>
            <aug>
               <au>
                  <snm>Thomas</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Rosenbloom</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Clawson</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hinrichs</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Trumbower</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Raney</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Karolchik</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Barber</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Harte</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Hillman-Jackson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kuhn</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Rhead</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Thakkapallayil</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Zweig</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kent</snm>
                  <fnm>WJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>D663</fpage>
            <lpage>D667</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1781110</pubid>
                  <pubid idtype="pmpid" link="fulltext">17166863</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl1017</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>The ENCODE (ENCyclopedia Of DNA Elements) Project</p>
            </title>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>306</volume>
            <fpage>636</fpage>
            <lpage>640</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1105136</pubid>
                  <pubid idtype="pmpid" link="fulltext">15499007</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <aug>
               <au>
                  <snm>Program</snm>
                  <fnm>NISCCVS</fnm>
               </au>
            </aug>
            <url>http://www.nisc.nih.gov/</url>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Automatic clustering of orthologs and in-paralogs from pairwise species comparisons</p>
            </title>
            <aug>
               <au>
                  <snm>Remm</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Storm</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2001</pubdate>
            <volume>314</volume>
            <fpage>1041</fpage>
            <lpage>1052</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2000.5197</pubid>
                  <pubid idtype="pmpid" link="fulltext">11743721</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>OrthoMCL: Identification of ortholog groups for eukaryotic genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Stoeckert</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Roos</snm>
                  <fnm>DS</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>2178</fpage>
            <lpage>2189</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403725</pubid>
                  <pubid idtype="pmpid" link="fulltext">12952885</pubid>
                  <pubid idtype="doi">10.1101/gr.1224503</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Mackey</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Stoeckert</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Roos</snm>
                  <fnm>DS</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <fpage>D363</fpage>
            <lpage>D368</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347485</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381887</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj123</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Inparanoid: a comprehensive database of eukaryotic orthologs</p>
            </title>
            <aug>
               <au>
                  <snm>O'Brien</snm>
                  <fnm>KP</fnm>
               </au>
               <au>
                  <snm>Remm</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>D476</fpage>
            <lpage>480</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540061</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608241</pubid>
                  <pubid idtype="doi">10.1093/nar/gki107</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Assessing performance of orthology detection strategies applied to eukaryotic genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Mackey</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Vermunt</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Roos</snm>
                  <fnm>DS</fnm>
               </au>
            </aug>
            <source>PLoS ONE</source>
            <pubdate>2007</pubdate>
            <volume>2</volume>
            <fpage>e383</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1849888</pubid>
                  <pubid idtype="pmpid" link="fulltext">17440619</pubid>
                  <pubid idtype="doi">10.1371/journal.pone.0000383</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Ensembl 2007</p>
            </title>
            <aug>
               <au>
                  <snm>Hubbard</snm>
                  <fnm>TJP</fnm>
               </au>
               <au>
                  <snm>Aken</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Beal</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ballester</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Caccamo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Clarke</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Coates</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Cunningham</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Cutts</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Down</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Dyer</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Fitzgerald</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Fernandez-Banet</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Graf</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Haider</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hammond</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Herrero</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Holland</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Howe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Howe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Kahari</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Keefe</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kokocinski</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Kulesha</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Lawson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Longden</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Melsopp</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Megy</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Meidl</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Overduin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Parker</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Prlic</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rice</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rios</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Schuster</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sealy</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Severin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Slater</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Smedley</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Spudich</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Trevanion</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Vilella</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Vogel</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wood</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Curwen</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Fernandez-Suarez</snm>
                  <fnm>XM</fnm>
               </au>
               <au>
                  <snm>Flicek</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kasprzyk</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Proctor</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Searle</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ureta-Vidal</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>D610</fpage>
            <lpage>D617</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1761443</pubid>
                  <pubid idtype="pmpid" link="fulltext">17148474</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl996</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Combining bioinformatics and phylogenetics to identify large sets of single-copy orthologous genes (COSII) for comparative, evolutionary and systematic studies: A test case in the euasterid plant clade</p>
            </title>
            <aug>
               <au>
                  <snm>Wu</snm>
                  <fnm>FN</fnm>
               </au>
               <au>
                  <snm>Mueller</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Crouzillat</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Petiard</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Tanksley</snm>
                  <fnm>SD</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2006</pubdate>
            <volume>174</volume>
            <fpage>1407</fpage>
            <lpage>1420</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1667096</pubid>
                  <pubid idtype="pmpid" link="fulltext">16951058</pubid>
                  <pubid idtype="doi">10.1534/genetics.106.062455</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>A practical approach to phylogenomics: the phylogeny of ray-finned fish (Actinopterygii) as a case study</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Orti</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>GQ</fnm>
               </au>
            </aug>
            <source>BMC Evol Biol</source>
            <pubdate>2007</pubdate>
            <volume>7</volume>
            <fpage>44</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1838417</pubid>
                  <pubid idtype="pmpid" link="fulltext">17374158</pubid>
                  <pubid idtype="doi">10.1186/1471-2148-7-44</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Phylogenomics and the reconstruction of the tree of life</p>
            </title>
            <aug>
               <au>
                  <snm>Delsuc</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Brinkmann</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Philippe</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>361</fpage>
            <lpage>375</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg1603</pubid>
                  <pubid idtype="pmpid" link="fulltext">15861208</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Choosing the best genes for the job: the case for stationary genes in genome-scale phylogenetics</p>
            </title>
            <aug>
               <au>
                  <snm>Collins</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Fedrigo</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Naylor</snm>
                  <fnm>GJP</fnm>
               </au>
            </aug>
            <source>Systematic Biology</source>
            <pubdate>2005</pubdate>
            <volume>54</volume>
            <fpage>493</fpage>
            <lpage>500</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/10635150590947339</pubid>
                  <pubid idtype="pmpid" link="fulltext">16012114</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>The root of the mammalian tree inferred from whole mitochondrial genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Phillips</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Penny</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Molecular Phylogenetics and Evolution</source>
            <pubdate>2003</pubdate>
            <volume>28</volume>
            <fpage>171</fpage>
            <lpage>185</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/S1055-7903(03)00057-5</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Comparison of models for nucleotide substitution used in maximum-likelihood phylogenetic estimation</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Goldman</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Friday</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1994</pubdate>
            <volume>11</volume>
            <fpage>316</fpage>
            <lpage>324</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8170371</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>The Ensembl automatic gene annotation system</p>
            </title>
            <aug>
               <au>
                  <snm>Curwen</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Eyras</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Andrews</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Clarke</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Mongin</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Searle</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Clamp</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>942</fpage>
            <lpage>950</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">479124</pubid>
                  <pubid idtype="pmpid" link="fulltext">15123590</pubid>
                  <pubid idtype="doi">10.1101/gr.1858004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>transAlign: using amino acids to facilitate the multiple alignment of protein-coding DNA sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Bininda-Emonds</snm>
                  <fnm>OR</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>156</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1175081</pubid>
                  <pubid idtype="pmpid" link="fulltext">15969769</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-6-156</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>The neighbor-joining method: a new method for reconstructing phylogenetic trees</p>
            </title>
            <aug>
               <au>
                  <snm>Saitou</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1987</pubdate>
            <volume>4</volume>
            <fpage>406</fpage>
            <lpage>425</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">3447015</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>PAUP*. Phylogenetic Analysis Using Parsimony (* and Other Methods). Version 4.0b10.</p>
            </title>
            <aug>
               <au>
                  <snm>Swofford</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <publisher>Sunderland, Massachusetts , Sinauer Associates</publisher>
            <pubdate>2002 </pubdate>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12504223</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Evolutionary trees from DNA sequences: a maximum likelihood approach</p>
            </title>
            <aug>
               <au>
                  <snm>Felsenstein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Journal of Molecular Evolution</source>
            <pubdate>1981</pubdate>
            <volume>17</volume>
            <fpage>368</fpage>
            <lpage>376</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF01734359</pubid>
                  <pubid idtype="pmpid">7288891</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>MODELTEST: testing the model of DNA substitution</p>
            </title>
            <aug>
               <au>
                  <snm>Posada</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Crandall</snm>
                  <fnm>KA</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <fpage>817</fpage>
            <lpage>818</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/14.9.817</pubid>
                  <pubid idtype="pmpid" link="fulltext">9918953</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>A new look at the statistical model identification</p>
            </title>
            <aug>
               <au>
                  <snm>Akaike</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>IEEE Trans Autom Contr</source>
            <pubdate>1974</pubdate>
            <volume>AC-19</volume>
            <fpage>716&#8211;723</fpage>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Estimating the pattern of nucleotide substitution</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Journal of Molecular Evolution</source>
            <pubdate>1994</pubdate>
            <volume>39</volume>
            <fpage>105</fpage>
            <lpage>111</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8064867</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Among-site rate variation and its impact on phylogenetic analyses</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Trends in Ecology and Evolution</source>
            <pubdate>1996</pubdate>
            <volume>11</volume>
            <fpage>367</fpage>
            <lpage>372</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/0169-5347(96)10041-0</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>SDM: A fast distance-based approach for (super) tree building in phylogenomics</p>
            </title>
            <aug>
               <au>
                  <snm>Criscuolo</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Berry</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Douzery</snm>
                  <fnm>EJP</fnm>
               </au>
               <au>
                  <snm>Gascuel</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Systematic Biology</source>
            <pubdate>2006</pubdate>
            <volume>55</volume>
            <fpage>740</fpage>
            <lpage>755</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/10635150600969872</pubid>
                  <pubid idtype="pmpid" link="fulltext">17060196</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Molecular phylogeny and divergence time estimates for major rodent groups: evidence from multiple genes</p>
            </title>
            <aug>
               <au>
                  <snm>Adkins</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Gelke</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Rowe</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Honeycutt</snm>
                  <fnm>RL</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2001</pubdate>
            <volume>18</volume>
            <fpage>777</fpage>
            <lpage>791</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11319262</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Molecular phylogeny of living xenarthrans and the impact of character and taxon sampling on the placental tree rooting</p>
            </title>
            <aug>
               <au>
                  <snm>Delsuc</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Scally</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Madsen</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Stanhope</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>de Jong</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Catzeflis</snm>
                  <fnm>FM</fnm>
               </au>
               <au>
                  <snm>Springer</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Douzery</snm>
                  <fnm>EJP</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2002</pubdate>
            <volume>19</volume>
            <fpage>1656</fpage>
            <lpage>1671</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12270893</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Molecular evidence for the major clades of placental mammals</p>
            </title>
            <aug>
               <au>
                  <snm>Scally</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Madsen</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Douady</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>de Jong</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Stanhope</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Springer</snm>
                  <fnm>MS</fnm>
               </au>
            </aug>
            <source>J Mammal Evol</source>
            <pubdate>2001</pubdate>
            <fpage>239</fpage>
            <lpage>277</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1023/A:1014446915393</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Using genomic data to unravel the root of the placental mammal phylogeny</p>
            </title>
            <aug>
               <au>
                  <snm>Murphy</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Pringle</snm>
                  <fnm>TH</fnm>
               </au>
               <au>
                  <snm>Crider</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Springer</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2007</pubdate>
            <volume>17</volume>
            <fpage>413</fpage>
            <lpage>421</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1832088</pubid>
                  <pubid idtype="pmpid" link="fulltext">17322288</pubid>
                  <pubid idtype="doi">10.1101/gr.5918807</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Multiple molecular evidences for a living mammalian fossil</p>
            </title>
            <aug>
               <au>
                  <snm>Huchon</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Chevret</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Jordan</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Kilpatrick</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Ranwez</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Jenkins</snm>
                  <fnm>PD</fnm>
               </au>
               <au>
                  <snm>Brosius</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schmitz</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <fpage>7495</fpage>
            <lpage>7499</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1863447</pubid>
                  <pubid idtype="pmpid" link="fulltext">17452635</pubid>
                  <pubid idtype="doi">10.1073/pnas.0701289104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Rodent phylogeny and a timescale for the evolution of Glires: evidence from an extensive taxon sampling using three nuclear genes</p>
            </title>
            <aug>
               <au>
                  <snm>Huchon</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Madsen</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Sibbald</snm>
                  <fnm>MJJB</fnm>
               </au>
               <au>
                  <snm>Ament</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Stanhope</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Catzeflis</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>de Jong</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Douzery</snm>
                  <fnm>EJP</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2002</pubdate>
            <volume>19</volume>
            <fpage>1053</fpage>
            <lpage>1065</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12082125</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Resolution of the early placental mammal radiation using Bayesian phylogenetics</p>
            </title>
            <aug>
               <au>
                  <snm>Murphy</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Eizirik</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>O'Brien</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Madsen</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Scally</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Douady</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Teeling</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ryder</snm>
                  <fnm>OA</fnm>
               </au>
               <au>
                  <snm>Stanhope</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>de Jong</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Springer</snm>
                  <fnm>MS</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>294</volume>
            <fpage>2348</fpage>
            <lpage>2351</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1067179</pubid>
                  <pubid idtype="pmpid" link="fulltext">11743200</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>A phylogenomic study of human, dog, and mouse</p>
            </title>
            <aug>
               <au>
                  <snm>Cannarozzi</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Schneider</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gonnet</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>PLoS Comput Biol</source>
            <pubdate>2007</pubdate>
            <volume>3</volume>
            <fpage>e2</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1761043</pubid>
                  <pubid idtype="pmpid" link="fulltext">17206860</pubid>
                  <pubid idtype="doi">10.1371/journal.pcbi.0030002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>The effect of branch lengths on phylogeny: An empirical study using highly conserved orthologs from mammalian genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Hughes</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Friedman</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Molecular Phylogenetics and Evolution</source>
            <pubdate>2007</pubdate>
            <volume>45</volume>
            <fpage>81</fpage>
            <lpage>88</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.ympev.2007.04.022</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Rates of genome evolution and branching order from whole genome analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Huttley</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Wakefield</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Easteal</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2007</pubdate>
            <volume>24</volume>
            <fpage>1722</fpage>
            <lpage>1730</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msm094</pubid>
                  <pubid idtype="pmpid" link="fulltext">17494028</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>A new phylogenetic marker, apolipoprotein B, provides compelling evidence for eutherian relationships</p>
            </title>
            <aug>
               <au>
                  <snm>Amrine-Madsen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Koepfli</snm>
                  <fnm>KP</fnm>
               </au>
               <au>
                  <snm>Wayne</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Springer</snm>
                  <fnm>MS</fnm>
               </au>
            </aug>
            <source>Molecular Phylogenetics and Evolution</source>
            <pubdate>2003</pubdate>
            <volume>28</volume>
            <fpage>225</fpage>
            <lpage>240</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/S1055-7903(03)00118-0</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Early history of mammals is elucidated with the ENCODE multiple species sequencing data</p>
            </title>
            <aug>
               <au>
                  <snm>Nikolaev</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Montoya-Burgos</snm>
                  <fnm>JI</fnm>
               </au>
               <au>
                  <snm>Margulies</snm>
                  <fnm>EH</fnm>
               </au>
               <au>
                  <snm>Rougemont</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Nyffeler</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Antonarakis</snm>
                  <fnm>SE</fnm>
               </au>
            </aug>
            <source>PLoS Genet</source>
            <pubdate>2007</pubdate>
            <volume>3</volume>
            <fpage>e2</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1761045</pubid>
                  <pubid idtype="pmpid" link="fulltext">17206863</pubid>
                  <pubid idtype="doi">10.1371/journal.pgen.0030002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Congruent mammalian trees from mitochondrial and nuclear genes using Bayesian methods</p>
            </title>
            <aug>
               <au>
                  <snm>Reyes</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gissi</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Catzeflis</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Nevo</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Pesole</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Saccone</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2004</pubdate>
            <volume>21</volume>
            <fpage>397</fpage>
            <lpage>403</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msh033</pubid>
                  <pubid idtype="pmpid" link="fulltext">14660685</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Sequence gaps join mice and men: phylogenetic evidence from deletions in two proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Poux</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>van Rheede</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Madsen</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>de Jong</snm>
                  <fnm>WW</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2002</pubdate>
            <volume>19</volume>
            <fpage>2035</fpage>
            <lpage>2037</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12411613</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Retroposed elements as archives for the evolutionary history of placental mammals</p>
            </title>
            <aug>
               <au>
                  <snm>Kriegs</snm>
                  <fnm>JO</fnm>
               </au>
               <au>
                  <snm>Churakov</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Kiefmann</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jordan</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Brosius</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schmitz</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2006</pubdate>
            <volume>4</volume>
            <fpage>e91</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1395351</pubid>
                  <pubid idtype="pmpid" link="fulltext">16515367</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0040091</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Multigene analyses of bilaterian animals corroborate the monophyly of Ecdysozoa, Lophotrochozoa, and Protostomia</p>
            </title>
            <aug>
               <au>
                  <snm>Philippe</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lartillot</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Brinkmann</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2005</pubdate>
            <volume>22</volume>
            <fpage>1246&#8211;1253</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/molbev/msi111</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Cases in which parsimony or compatibility methods will be positively misleading</p>
            </title>
            <aug>
               <au>
                  <snm>Felsenstein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Systematic Zoology</source>
            <pubdate>1978</pubdate>
            <volume>27</volume>
            <fpage>401</fpage>
            <lpage>410</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/2412923</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>The pitfalls of molecular phylogeny based on four species as illustrated by the Cetacea/Artiodactyla relationships</p>
            </title>
            <aug>
               <au>
                  <snm>Philippe</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Douzery</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>J Mammal Evol</source>
            <pubdate>1994</pubdate>
            <volume>2</volume>
            <fpage>133</fpage>
            <lpage>152</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/BF01464365</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Rabbits, if anything, are likely Glires</p>
            </title>
            <aug>
               <au>
                  <snm>Douzery</snm>
                  <fnm>EJP</fnm>
               </au>
               <au>
                  <snm>Huchon</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Molecular Phylogenetics and Evolution</source>
            <pubdate>2004</pubdate>
            <volume>33</volume>
            <fpage>922</fpage>
            <lpage>935</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.ympev.2004.07.014</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>Are Guinea pigs Rodents ? The importance of adequate models in molecular phylogenetics</p>
            </title>
            <aug>
               <au>
                  <snm>Sullivan</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Swofford</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <source>J Mammal Evol</source>
            <pubdate>1997</pubdate>
            <volume>4</volume>
            <fpage>77</fpage>
            <lpage>86</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1023/A:1027314112438</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Dog as an outgroup to human and mouse</p>
            </title>
            <aug>
               <au>
                  <snm>Lunter</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>PLoS Comput Biol</source>
            <pubdate>2007</pubdate>
            <volume>3</volume>
            <fpage>e74</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1857806</pubid>
                  <pubid idtype="pmpid" link="fulltext">17465673</pubid>
                  <pubid idtype="doi">10.1371/journal.pcbi.0030074</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Toward a phylogenetic classification of the Mammalia</p>
            </title>
            <aug>
               <au>
                  <snm>McKenna</snm>
                  <fnm>MC</fnm>
               </au>
            </aug>
            <source>Phylogeny of the primates</source>
            <publisher>New York , Plenum Press</publisher>
            <editor>Luckett WP, Szalay FS</editor>
            <pubdate>1975</pubdate>
            <fpage>21</fpage>
            <lpage>46</lpage>
         </bibl>
         <bibl id="B65">
            <title>
               <p>Using novel phylogenetic methods to evaluate mammalian mtDNA, including amino acid-invariant sites-LogDet plus site stripping, to detect internal conflicts in the data, with special reference to the positions of hedgehog, armadillo, and elephant</p>
            </title>
            <aug>
               <au>
                  <snm>Waddell</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Cao</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hauf</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hasegawa</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Systematic Biology</source>
            <pubdate>1999</pubdate>
            <volume>48</volume>
            <issue>1</issue>
            <fpage>31</fpage>
            <lpage>53</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/106351599260427</pubid>
                  <pubid idtype="pmpid">12078643</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Phylogenomic data analyses provide evidence that Xenarthra and Afrotheria are sister groups</p>
            </title>
            <aug>
               <au>
                  <snm>Hallstrom</snm>
                  <fnm>BM</fnm>
               </au>
               <au>
                  <snm>Kullberg</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nilsson</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Janke</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2007</pubdate>
            <volume>24</volume>
            <fpage>2059</fpage>
            <lpage>2068</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msm136</pubid>
                  <pubid idtype="pmpid" link="fulltext">17630282</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>Genomics, biogeography, and the diversification of placental mammals</p>
            </title>
            <aug>
               <au>
                  <snm>Wildman</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Uddin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Opazo</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Lefort</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Guindon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gascuel</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Grossman</snm>
                  <fnm>LI</fnm>
               </au>
               <au>
                  <snm>Romero</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Goodman</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <fpage>14395</fpage>
            <lpage>14400</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1958817</pubid>
                  <pubid idtype="pmpid" link="fulltext">17728403</pubid>
                  <pubid idtype="doi">10.1073/pnas.0704342104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>Rooting the eutherian tree: the power and pitfalls of phylogenomics</p>
            </title>
            <aug>
               <au>
                  <snm>Nishihara</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Okada</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hasegawa</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>R199</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2007-8-9-r199</pubid>
                  <pubid idtype="pmpid" link="fulltext">17883877</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
